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METHODS TO MEDIATE POLYKETIDE SYNTHASE 
MODULE EFFECTIVENESS 

Cross-Reference to Related Applications 

[0001] This application claims the benefit of the filing date of U.S. provisional 
patent application No. 60/361,758, filed March 4, 2002. This application also claims 
priority to U.S. patent application No. 10/091,244, filed March 4, 2002, which claims 
the benefit of the filing date of U.S. patent application No. 09/500,747, filed 
9 February 2000, which, in its turn, claims the benefit of the filing date of U.S. 
provisional application No. 60/119,363, filed 9 February 1999. Furthermore, U.S. 
patent application No. 10/091,244 claims the benefit of the filing date of U.S. 
Provisional Application Nos. 60/272,985 and 60/272,987, both filed 2 March 2001. 
Each of these applications are incorporated herein by reference. 

Statement of Rights to Inventions Made Under Federally Sponsored Research 
[0002] The invention herein was made, at least in part, based on support by grants 
CA-66736, GM-22172, and GM-22176 fi-om the National Institutes of Health, and 
grant BES-9806774 from the National Science Foundation. The U.S. government 
may have certain rights in the invention. 

Technical Field 

[0003] The invention is directed to facilitating usage by polyketide synthase 
modules of nascent polyketide chains. Specifically, the invention concerns including 
intermodule and intramodule linkers in constructions for synthesis of desired 
polyketides. More specifically, the invention concerns the effects of protein-protein 
interactions and enzyme-substrate interactions in the channeling of intermediates 
between polyketide synthase modules. 

Introduction 

[0004] The present invention concerns modular PKS. Modular polyketide 
synthases (PKSs) are multienzyme assemblies responsible for the biosynthesis of 
numerous phannacologically relevant natural products including the antibiotic 
erythromycin and the immunosuppressant FK506. As shown in the schematic 
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diagram of the 6-deoxyerythronolide B synthase (DEBS) in Figures 1 and 23, the 
active sites of these enzymes are organized into distinct modxiles, each of which is 
responsible for elongating the polyketide chain by one ketide unit through the 
coordinated action of the three core active sites - a ketosynthase (KS), an 
acyltransferase (AT), and an acyl carrier protein (ACP). In addition to these three 
core active sites, there are a variable number of postcondensational active sites within 
each module - including a ketoreductase (KR), a dehydratase (DH), and an 
enoylreductase (ER) - that generate structural diversity in the final product. The 
growing polyketide chain is processively elongated as it passes through each of the 
modules in an assembly line fashion such that the number of extensions is dictated by 
the number of modules in the enzyme system. The choices of building blocks made 
by each module and the number and types of domains within each module catalyzing 
postcondensation reactions dictate the chemical ftmctionality at each carbon atom in 
the final product. 

[0005] The unique organization of modular PKSs and the transparency of the 
functional code offer tremendous potential for the use of these enzyme systems as a 
scaffold for the generation of novel small molecules through combinatorial 
biosynthesis. Of all possible strategies for generating new natural product-like 
molecules, the fusion of intact modules from different sources (also referred to as 
"module swapping") presents one of the most appealing methods of generating new 
compounds. According to this strategy, since each module controls the functionality 
and stereochemistry of two adjacent carbon atoms, novel compounds can be generated 
by simply rearranging the order of modules along the assembly line. While there are 
a few examples of successflil use of this strategy (Gokhale, et al., (1999) Science 284, 
482-5; Ranganathan, et al., (1999) Chem Biol 6, 731-41; Wu,et al., (2001) J. Am. 
Chem. Soc. 123, 6465-6474), it is still not clear what factors axe important in 
mediatmg intermodular transfer, and how much of a role each factor plays. 
[0006] Further, the cloning, analysis, and recombinant DNA technology of genes 
that encode PKS enzymes allows one to manipulate a known PKS gene cluster either 
to produce the polyketide synthesized by that PKS at higher levels than occur m 
nature or in hosts that otherwise do not produce the polyketide. The technology also 
allows one to produce molecules that are structurally related to, but distinct firom, the 
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polyketides produced from known PKS gene clusters. See, e.g., PCX publication 
Nos. WO 93/13663; 95/08548; 96/40968; 97/02358; 98/27203; and 98/49315; United 
States Patent Nos. 4,874,748; 5,063,155; 5,098,837; 5,149,639; 5,672,491; 5,712,146; 
5,830,750; and 5,843,718; and Fu, et al, 1994, Biochemistry 33: 9321-9326; 
McDaniel, etal, 1993, Science 262: 1546-1550; and Rohr, 1995,Angew. Chem. Int. 
Ed. Engl 34(8): 881-888, each of which is incorporated herein by reference. 
[0007] PCX publication WO 98/493 1 5, the contents of which are incorporated 
herein by reference, describes an approach for modifying the enzymatic activities 
included within modules of a PKS by maiataining the scaffolding intact but replacing 
catalytic domains with different catalytic domains. U.S. Serial No. 09/346,860 filed 
2 July 1999 and the corresponding PCX publication WO 00/01 838, also filed on that 
date, and incorporated herein by reference describe alternative methods by altering the 
hypeivai iable region of the AX domains so as to alter the specificity for an extender 
unit and alteration of the KS domains to control stereochemistry. Xhe present 
invention takes advantage of the approach of manipulatiag modules so that the 
catalytic activities of an entire module are placed in the appropriate sequence to 
construct a desired polyketide. Xhe ability to utilize this approach depends on 
effecting an appropriate means for the module to incorporate a growing polyketide 
chain, which involves assuring that an appropriate linker region is included. Since the 
filing of the provisional application from which the present application claims 
priority, a related paper has been published by Ranganathan, A., et al, Chem. & Biol 
(1999) 6:731-741. In this paper, intrapolypeptide linkages are fortuitously supplied to 
chimeric modules by including the KS region of the native downstream module in a 
chimera between the corresponding upstream module and the portions downstream of 
the KS domain in a heterologous module. Alternatively, the downstream module will 
include the ACP catalytic domain of the native upstream module fused to the 
remainder of a heterologous module upstream in the chimera. 
[0008] hi PKS polypeptides, the regions that encode enzymatic activities 
(domains) are separated by linker or "scaffold'-encoding regions. These scaffold 
regions encode amino acid sequences that space the domains at the appropriate 
distances and in the correct order. Xhus, the linker regions of a PKS protein 
collectively can be considered to encode a scaffold into which the various domains 
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(and thus modules) are placed in a particular order and spatial arrangement. 
Generally, this organization permits PKS catalytic domains of different or identical 
substrate specificities to be substituted (usually at the DNA level) between PKS 
enzymes by various available methodologies. Thus, there is considerable flexibility 
in the design of new PKS en2ymes with the result that known polyketides can be 
produced more effectively, and novel polyketides useful as pharmaceuticals or for 
other purposes can be made. 

[0009] Linker regions at the N- and C-termini of each polypeptide interface 
(shown as matching tabs in figures 1 and 23) have been previously identified as 
important factors for mediating specific channeling between polypeptides. Consisting 
of approximately 30-90 hypervariable residues, these linker regions have been 
suggested to form coiled-coils and have been shown to interact pairwise and 
specifically with each other (i.e., the C-terminal linker of module 2 interacts 
specifically with the N-terminal linker of module 3, and the C-terminal linker of 
module 4 interacts specifically with the N-terminal linker of module 5) (Gokhale, et 
al., (1999) Science 284, 482-5; Tsuji, et al., (2001) Biochemistry 40, 2317-2325). 
While the importance of these linker regions in mediating intermodular specificity has 
been demonstrated, other interpolypeptide interactions have not been ruled out. The 
most likely candidate for relevant intermodular interactions is the interface between 
the ACP domain of the upstream module and the KS domain of the downstream 
module, since these two active sites are involved in forming the tetrahedral 
intermediate of the trans-thioesterification reaction as the polyketide intermediate is 
channeled from one module to the next. 

[0010] The present invention identifies the role of protein-protein interactions 
between a donor acyl carrier protein (ACP) domain and a downstream ketosynthase 
(KS) domain in the channeling of intermediates between polyketide synthase modules 
and between a polyketide synthase module and a NRPS module. 

Background of the Invention 

[0011] Polyketides are a class of compounds synthesized from 2-carbon units 
through a series of condensations and subsequent modifications. Polyketides occur in 
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many types of organisms, including fungi and mycelial bacteria, in particular, the 
actinomycetes. 

[0012] Polyketides are biologically active molecules with a wide variety of 
structiires, and the class encompasses nmiierous compounds with diverse activities. 
Tetracycline, erythromycin, epothilone, FK-506, FK-520, narbomycin, picromycin, 
rapamycin, spinocyn, and tylosin are examples of polyketides. Given the difficulty in 
producing polyketide compounds by traditional chemical methodology, and the 
typically low production of polyketides in wild-type cells, there has been considerable 
interest in finding improved or alternate means to produce polyketide compounds. 
[0013] The biosynthetic diversity of polyketides is generated by repetitive 
condensations of simple monomers by polyketide synthase (PKS) enzymes that mimic 
fatty acid synthases. For instance, the deoxyerythronolide-B synthase catalyzes the 
chain extension of a primer with several methylmalonyl coenzyme A (MeMalCoA) 
extender units to produce the erythromycin core. 

[0014] The cloning, analysis, and recombinant DNA technology of genes that 
encode PKS enzymes allows one to manipulate a known PKS gene cluster either to 
produce the polyketide synthesized by that PKS at higher levels than occur in nature 
or in hosts that otherwise do not produce the polyketide. The technology also allows 
one to produce molecules that are structurally related to, but distinct from, the 
polyketides produced from known PKS gene clusters. See, e.g., PCT publication 
Nos. WO 93/13663; 95/08548; 96/40968; 97/02358; 98/27203; and 98/49315; United 
States Patent Nos. 4,874,748; 5,063,155; 5,098,837; 5,149,639; 5,672,491; 5,712,146; 
5,830,750; and 5,843,718; and Fu, et al, 1994, Biochemistry 53: 9321-9326; 
McDaniel, etal, 1993, Science 262: 1546-1550; and Rohr, \995, Angew. Chem. Int. 
Ed. Engl. 34(8): 881-888, each of which is incorporated herein by reference. 
[0015] PKSs catalyze the biosynthesis of polyketides through repeated, 
decarboxylative Claisen condensations between acylthioester building blocks. The 
building blocks used to form complex polyketides are typically acylthioesters, such as 
acetyl, butyryl, propionyl, malonyl, hydroxymalonyl, methylmalonyl, and 
ethylmalonyl CoA. 

[0016] Two major types of polyketide synthase (PKS) enzymes are known; these 
differ in their composition and mode of synthesis of the polyketide synthesized. 
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These two major types of PKS enzymes are commonly referred to as Type I or 
"modular" and Type n or "iterative" PKS enzymes. 

[0017] In the Type I or modular PKS enzyme group, a set of separate catalytic 
active sites (each active site is termed a "domain", and a set thereof is termed a 
"module") exists for each cycle of carbon chain elongation and modification in the 
polyketide synthesis pathway. The typical modular PKS is composed of several large 
polypeptides, which can be segregated from amino to carboxy terminii into a loading 
module, multiple extender modules, and a releasing (or thioesterase) domain. The 
PKS enzyme known as 6-deoxyerytlironolide B synthase (DEBS) is a typical Type I 
PKS. In DEBS, there is a loading module, six extender modules, and a thioesterase 
(TE) domain. The loading module, six extender modules, and TE of DEBS are 
present on three separate proteins (designated DEBS-1, DEBS-2, and DEBS-3, with 
two extender modules per protein). Each of the DEBS polypeptides is encoded by a 
separate open reading frame (ORF) or gene; these genes are known as eryAI, eryAII, 
and eryAIII. See Figure 1. There is considerable interest in the genetic and chemical 
reprogramming of modular PKSs (see, e.g., Khosla, 1997, Chem. Rev. P7:2577-2590, 
and Staunton, et al, 1997, Chem. Rev. 261 1-2629, each of which is incorporated 
herein by reference). 

[0018] Generally, the loading module is responsible for binding the first building 
block used to synthesize the polyketide and transferring it to the first extender 
module. The loading module of DEBS consists of an acyltransferase (AT) domain 
and an acyl carrier protein (ACP) domain. Another type of loading module utilizes an 
inactivated KS, an AT, and an ACP. This inactivated KS is in some instances called 
KS*^, where the superscript letter is the abbreviation for the amino acid, glutamine, 
that is present instead of the active site cysteine required for ketosynthase activity. In 
other PKS enzymes, uicluding the FK-520 PKS, the loading module incorporates an 
unusual starter unit and is composed of a CoA ligase activity domain. In any event, 
the loading module recognizes a particular acyl-CoA (usually acetyl or propionyl but 
sometimes butjryl) and transfers it as a thiol ester to the ACP of the loading module. 
[0019] The AT on each of the extender modules recognizes a particular extender- 
CoA (malonyl or alpha-substituted malonyl, i.e., methylmalonyl, ethylmalonyl, and 
hydroxymalonyl) and transfers it to the ACP of that extender module to form a 
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thioester. Each extender module is responsible for accepting a compound from a prior 
module, binding a building block, attaching the building block to the compound from 
the prior module, optionally performing one or more additional functions, and 
transferring the resulting compound to the next module. The transfer into a module is 
mediated by the KS domain which is upstream of the remaining catalytic domains. 
The additional functions are performed by enzymes which comprise a ketoreductase 
(KR) which reduces the carbonyl group generated from the condensation to an 
alcohol, a dehydratase (DH) which converts the alcohol to a double bond, and an 
enoyl reductase (ER) which reduces the double bond to a single bond. These catalytic 
domains appear to be immediately adjacent and not separated by any linking 
sequences. Collectively, they can be called "beta-carbonyl modifying" domains. 
Thus, a particular module may contain none of these activities, only KR, or KR+DH, 
or KR+DH+ER. Thus, the order of domains &om the N-terminus of a particular 
module is KS, AT, beta-carbonyl modifying domains (if present), ACP. The order, 
N-»C of the beta-carbonyl modifying enzymes is DH ER KR. 
[0020] Thus, each extender module of a modular PKS contains zero, one, two, or 
three enzymes that modify the beta-carbon of the growing polyketide chain 
downstream of the AT catalytic domain. A typical (non-loading) mitumal Type I 
PKS extender module is exemplified by extender module 3 of DEBS, which contains 
only a KS domain, an AT domain, and an ACP domain. The next extender module, 
module 4, contaias all three beta-carbonyl modifying enzymes. (The beta-carbonyl 
modifying enzymes effect such modification on tlie extender imit that has been added 
by the previous module.) 

[0021] Once the PKS is primed with acyl- and malonyl-ACPs, the acyl group of 
the loading module migrates to form a thiol ester (trans-esterification) at the KS of the 
first extender module; at this stage, extender module one possesses an acyl-KS 
adjacent to a malonyl (or substituted malonyl) ACP. The acyl group derived fi-om the 
loading module is then covalently attached to the alpha-carbon of the malonyl group 
to form a carbon-carbon bond, driven by concomitant decarboxylation, and generating 
a new acyl- ACP that has a backbone two carbons longer than the loading building 
block (elongation or extension). 
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[0022] After traversing the final extender module, the polyketide encounters a 
releasing domain that cleaves the polyketide ftom the PKS and typically cyclizes the 
polyketide. For example, final synthesis of 6-dEB is regulated by a TE domain 
located at the end of extender module six. In the synthesis of 6-dEB, the TE domain 
catalyzes cyclization of the macrolide ring by formation of an ester linkage. In 
FK-506, FK-520, rapamycin, and similar polyketides, the ester linkage formed by the 
TE activity is replaced by a linkage formed by incorporation of a pipecolate acid 
residue. The enzymatic activity that catalyzes this incorporation for the rapamycin 
enzyme is known as RapP, encoded by the rapP gene. The polyketide can be 
modified further by tailoring enzymes; these enzymes add carbohydrate groups or 
methyl groups, or make other modifications, i.e., oxidation or reduction, on the 
polyketide core molecule. For example, 6-dEB is hydroxylated at C6 and C12 and 
glycosylated at C3 and C5 in the synthesis of erythromycin A. 

Background Information 

[0023] The following articles provide information relating to the invention: 
Aparicio, J. F., et al, (1996) Gene 169, 9-16; Cortes, J., et al, (1990) Nature 348, 
176-178; Donadio, S., et al, (1991) Science 252, 675-679; Gokhale, R. S., et al, 
(2000) Curr. Opin. Chem. Biol 4, 22-27. 

Abbreviations 

[0024] 6-dEB: 6-deoxyerythronolide B; ACP: acyl carrier protein; AT: 
acyltransferase; DEBS: 6-deoxyerythronolide B synthase; DH: dehydratase; ER: 
enoylreductase; KR: ketoreductase; KS: ketosynthase; NAC: N-acetylcysteamine; 
NRPS: nonribosomal peptide synthetase; PCP: peptidyl carrier protein; PKS: 
polyketide synthase; ACP: acyl carrier protein; ER: enoylreductase; LDD: loading 
didomain; TE: thioesterase; M2: module 2 of DEBS; M2(4): module 2 with C- 
terminal linker from module 4; M3+TE: module 3 fused to thioesterase; (5)M3+TE: 
module 3 with N-terminal linker &om modide 5; M2:M3: complex of module 2 and 
module 3; and NDK: (2S,3R)-2-methyl-3-hydroxypentanoic acid diketide. 
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Summary of the Invention 

[0025] The invention is directed to an efficient method for constructing an 
arbitrarily chosen polyketide synthase, and therefore a desired polyketide, by 
manipulating entire modules of Type I polyketide synthases. The invention enables 
this approach by providing the modules with the appropriate "lead-in" or linker 
sequence to the ketosynthase (KS). Applicants have discovered that the appropriate 
linker between modules is required upstream of the relevant KS in order to permit the 
module to accept the nascent polyketide chain, and, in the case of intermolecular 
transfer, appropriate pairing of N-tertninal and C-terminal regions assures the 
appropriate transfer. The nature of this linker varies depending on whether the 
module is covalently linked downstream from another module, or whether it forms the 
N-terminus of the polypeptide. 

[0026] Thus, in one aspect, the invention is directed to a method to construct a 
functional polyketide synthase which method comprises providing each module 
contained in the desired polyketide synthase with an appropriate intrapolypeptide 
linker (RAL) when said module is downstream in the same polypeptide from a 

module derived ftom a different PKS and with an appropriate uiterpolypeptide Imker 
(ERL) when the module is derived from a PKS where the module is the N-terminal 
module of a polypeptide. If the module at the N-terminus of a polypeptide is to 
accept a nascent polyketide chain from an upstream module, the interpolypeptide 
linker needs to include the appropriate amino acid sequence at the C-terminus of the 
module donating the nascent chain. 

[0027] In describing a "module" being provided with linker(s) the term "module" 
refers to the functional portions extending approximately from the N-teiminus of tlae 
KS catalytic region to the C-terminus of the ACP - i.e., excludes the Imker portions 
otherwise considered a portion of the module. 

[0028] As further described below, any order of modules of desired specificity 
can be assured by providing the appropriate linkers either intermolecularly or 
intramolecularly. Thus, the polyketide synthase can be assembled from individual 
modules by providing the appropriate linkers to assure that the polyketide chain will 
be passed in the correct sequence from one module to the next and by assembling 
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these modules either by directly providing the polypeptides containing them or by co- 
expressing nucleotide sequences and coding them in a host cell. 
[0029] In other aspects, the invention is directed to materials and compositions 
useful in carrying out the method, in particular to isolated DNA fragments which 
contain the appropriate intrapolypeptide and interpolypeptide linkers. The invention 
also relates to methods to construct functional polyketide synthases firom libraries of 
modules and to polyketides prepared by supplying appropriate substrates to 
reconstructed polyketide synthases. The polyketides thus prepared can be "tailored" 
using either isolated enzymes or feeding the polyketides to an organism containing 
these enzymes to convert them to anti-infectives or compounds of other activities such 
as motolides by such post-polyketide modifications as hydroxylation and 
glycosylation. The ketides or ketolides or their modified forms can also be further 
derivatized using chemical synthetic methods. 

[0030] In other apects, the invention is directed towards the C- and N-terminal 
ends of adjacent PKS polypeptides capped by peptides of 20-40 residues. 
Mismatched sequences abolish intermodular chain transfer without affecting the 
activity of individual modules, whereas matched sequences can facilitate the 
channeling of intermediates between ordinarily non-consecutive modules. 
[0031] In yet another aspect, the invention is directed towards the role of protein- 
protein interactions between the donor acyl carrier protein (ACP) domain and the 
downstream ketosynthase (KS) domain in various contexts as well as the role of 
linker interactions. Linker interactions and ACP-KS interactions make relatively equal 
contributions at the module 2-module 3 and the module 4-module 5 interfaces in 
DEBS. In contrast, modules 2 and 6 are more tolerant toward substrates presented by 
non-natural ACP domains. This tolerance was exploited for engineering hybrid PKS- 
PKS and PKS-NRPS (non-ribosomal peptide synthetase) junctions and suggests 
fundamental ground rules for engineering novel chimeric PKSs in the future. 
[0032] In yet another aspect, the invention is directed towards the role of protein- 
protein interactions in substrate channeling and more specifically to assays or 
methods to assess the steady-state kinetic parameters of individual DEBS modules 
when primed in a channeling modes versus a diffusive mode. The diffiisive process 
precludes the involvement of the covalent, substrate channeling mechanism by which 
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enzyme-bound intermediates are directly transferred from one module to the next in a 
multi-modular PKS. These methods can be used to quantify the kinetic benefit of 
linker-mediated substrate channeling in a modular PKS. 
[0033] In another aspect, the invention is directed towards the ability of a 
synthetic peptide to inhibit tetrafcetide production. For example, a peptide 
corresponding to the N-terminal linker of module 3 was synthesized and shown to 
inhibit the formation of tetraketide lactone 2 (as shown in Figure 6) in the presence of 
M2 and M3+TE in a concentration-dependent manner. 

[0034] In yet another aspect, the invention is directed towards a method to prepare 
a hybrid modular polyketide synthase (PKS) from individual modules which method 
comprises providing at least a first naturally occurring extender module comprising an 
ACP domain and a second naturally occuning extender module comprising a KS 
domain which is downstream of the ACP domain in a naturally occurring PKS, 
wherein the C-terminus of said ACP domain is covalently linked to the N-tenninus of 
a naturally occurring intrapolypeptide linker (RAL) or interpolypeptide linker (ERL) 
and the N-termmus of said KS domain is covalently linked to the C-terminus of said 
RAL or ERL, and wherein either said first module or second module is not covalently 
linked to said RAL or ERL in a naturally occurring polyketide synthase. 
[0035] In another aspect, the invention is directed towards a method to prepare a 
hybrid modular polyketide synthase (PKS) from individual modules which method 
comprises providing at least a first naturally occurring extender module comprising an 
ACP domain and a second naturally occurring extender module comprising a KS 
domain which is not normally downstream of the ACP domain in a naturally 
occurring PKS, wherein the C-terminus of said ACP domain is covalently linked to 
the N-terminus of a naturally occurring intrapolypeptide linker (RAL) or 
interpolypeptide Imker (ERL) and the N-terminus of said KS domain is covalently 
linked to the C-terminus of said RAL or ERL, and wherein either said first or second 
module is not covalently linked to said RAL or ERL in a naturally occurring 
polyketide synthase. 

[0036] In other aspects, the invention is directed towards a method to prepare a 
hybrid nonribosomal peptide synthetase-modular polyketide synthase (NRPS-PKS) 
from individual modules which method comprises providing at least a first naturally 
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occurring extender module comprising a peptidyl carrier protein (PCP) domain from a 
naturally occurring NRPS and a second naturally occurring extender module 
comprising a KS domain from a PKS, wherein the C-terminus of said PCP domain is 
covalently linked to the N-terminus of a naturally occurring intrapolypeptide linker 
(RAL) or interpolypeptide linker (ERL) and the N-terminus of the KS domain is 
covalently linked to the C-terminus of said RAL or ERL, and wherein either said JBrst 
or second module is not covalently linked to said RAL or ERL in a naturally 
occurring NRPS or PKS. 

Brief Description of the Drawings 

[0037] Figure 1 is a diagram of erj^thromycin PKS which fonns 6-dEB. 
[0038] Figure 2 shows the conversion of a diketide thioester to the triketides 
corresponding to those produced by DEBS-1 of the erythromycin PKS. 
[0039] Figure 3 shows sequences of intrapolypeptide and interpolypetide linkers 
derived from various Type I PKS (SEQ ID NOs: 18-34, in order of appearance). 
[0040] Figure 4 is a schematic diagram of the biosynthesis of 6-dEB by 6- 
deoxyerythronoUde B synthase. 

[0041] Figure 5 presents a velocity vs. NDK (mM) plot showing the fit of the 
Michaelis-Menten equation for MS + TE and (5)M3 + TE alone. 
[0042] Figure 6 is a schematic diagram of the interpolypeptide transfer with 
matched and mismatched linker pairs. 

[0043] Figure 7 provides saturation curves of (A) M2 with MB + TE and (B) 

M2(4) with (5)M3 + TE. 

[0044] Figure 8 shows selective inhibition by a synthetic peptide mimic of the N- 
terminal linker of M3. 

[0045] Figure 9 provides the CD spectrum of a peptide mimic showing the 
minima at 208 and 222 nm indicative of a-helical character. 
[0046] Figure 10 presents schematic illustrations of three mechanisms of loading 
a DEBS module with a diketide 

[0047] Figure 1 1 illustrates four diketides and thek corresponding, putative 
enzymatic products. 
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[0048] Figure 12 presents the reaction schemes of the three bimodular DEBS 
derivatives-Ml+M5+TE (module 1 + module 5 + TE), M1+M6+TE, and M1+M2+TE 
(DEBSl+TE)-and their corresponding kcat values. 

[0049] Figure 13 presents a comparison of the kcat/KM values (min ' mM'* ) of the 
two syn-diketides when presented as acyl-ACP substrates (4a and 4b) vs when 
presented as NAC-thioesters (2a and 2b). 

[0050] Figure 1 4 presents a comparison of the kcat values (min"' ) of the four 
diketides when presented as acyl-ACP substrates (4a-d) vs when presented as NAC- 
thioesters (2a-d). 

[0051] Figure 1 5 presents conditions, results, and proposed mechanisms of back- 
transfer experiments. 

[0052] Figure 1 6 is a schematic of 6-deoxyerythronolide B synthase. 
[0053] Figure 1 7 is a schematic of matched and mismatched linker pairs at 
intermodular interfaces. 

[0054] Figure 1 8 is a schematic illustrating the channeling of intermediates to 
uimatural recipient modules. 

[0055] Figure 1 9 is a schematic illustrating the replacement of the entire donor 
module with just the acyl carrier protein. 

[0056] Figure 20 is a table summarizing the substrates and enzymes used to study 
substrate transfer from a donor to a recipient module. 

[0057] Figure 2 1 shows data supporting a coiled-coil conformation for linkers. 
[0058] Figure 22 is a schematic illustrating how exogenous peptide mimetics of 
linkers can inliibit chain transfer. 

[0059] Figure 23 is a schematic diagram of the biosynthesis of 6-dEB by 6- 
deoxyerythronolide B synthase. 

[0060] Figures 24 A-C depict the modularity of the linker regions in the ACP4- 
module 5 interface. 

[0061] Figure 25 presents an SDS-PAGE image of the purified protein substrates. 
[0062] Figures 26 A-C provide the modularity of the linker regions in the ACP2- 
module 3 interface. 
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[0063] Figures 27 A-D present a schematic diagram and kinetic parameters of the 

four combinations of matched and mismatched linker regions and matched and 

mismatched ACP-KS pairs with module 3 as the acceptor module. 

[0064] Figures 28 A-F present a schematic diagram and kinetic parameters of the 

four combinations of matched and mismatched linker regions and matched and 

mismatched ACP-KS pairs with module 5 as the acceptor module. 

[0065] Figure 29 presents a linker-less ACP4(0) as the donor protein. 

[0066] Figures 30 A-B present a qualitative assessment of the ability of various 

donor proteins to transfer diketide substrates to modules 2 and 6, and a representative 

radio-TLC image of such qualitative assays, respectively. 

[0067] Figures 3 1 A-B represent an alignment of the 6 EryA SU (SEQ ID 

NOs:35-40, respectively). 

Detailed Description of the Drawings 

[0068] Figure 1 is a diagram of the erythromycin PKS which forms 6-dEB, the 
core precursor of erythromycin. As shown, the PKS is comprised of three proteins, , 
DEBS-1, DEBS-2 and DEBS-3 which are encoded by three genes, commonly called 
eryAI, eryAII and eryAUI. 

[0069] Figure 2 shows the conversion of a diketide thioester to the triketides 
corresponding to those produced by DEBS-1 of the erythromycin PKS. 
[0070] Figure 3 shows the structures of intrapolypeptide linkers and the 
N-terminal portions of interpolypeptide linkers (SEQ ID NOs: 1 8-34, in order of 
appearance) derived from various Type I PKS. 

[0071] Figure 4 is a schematic diagram of the biosynthesis of 6-dEB by 6- 
deoxyerythronolide B synthase. Each polypeptide, DEBSl, DEBS2, and DEBS3, 
contams two modules, and each module comprises a set of active site domains 
responsible for addition and modification of an extender unit. The short "linker" 
regions are located at the N- and C-termmi; their shapes exemplify the 
complementarity demonstrated by each pair. 

[0072] Figure 5 presents a velocity vs. NDK (mM) plot showing the fit of the 
Michaelis-Menten equation for M3 + TE and (5)M3 + TE alone. The nearly identical 
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kcat values of 0.71 and 0.68 min"' and Km values of 2.8 and 2.5|jM demonstrate the 
interchangeability of the linkers for individual modules. 

[0073] Figure 6 presents a schematic diagram of the interpolypeptide transfer with 
matched and mismatched linker pairs: (A) wild-type linker pair, M2 C-terminus + M3 
N-terminus; (B) mismatched pair, M2 C-terminus + M5 N-terminus; (C) mismatched 
pair, M4 C-terminus + M3 N-termmus; (D) matched pair, M4 C-terminus + M5 N- 
terminus. 

[0074] Figure 7 provides saturation curves of (A) M2 with M3 + TE and (B) 
M2(4) with (5)M3 + TE showing the effect of increasing M3 concentration on the 
overall rate of turnover. From these plots, the saturating rates of 0.27 and 0.74 min"^ 
were determined, as were the Kd values of 1 . 1 and 2.1 pM . 
[0075] Figure 8 shows selective inhibition by the synthetic peptide mimic of the 
N-terminal linker of M3. ITie linker lowered the overall rate of tetraketide production 
in a dose-dependent fashion for the transfer using the M2-M3 linker pair. However, it 
demonstrated no such effect when tetraketide production depended upon 
interpolypeptide transfer using the M4-M5 linker pair. 

[0076] Figure 9 provides the CD spectrum of the peptide mimic showing the 
minima at 208 and 222 nm indicative of a-helical character. Though only ca. 50% 
helical, its structural features correlate with the expected coiled-coil motif. 
Furthermore, its ability to selectively inliibit interpolypeptide transfer verified some 
recognition ability of the mimic. 

[0077] Figure 1 0 presents schematic illustrations of the three mechanisms of 
loading a DEBS module with a diketide. (A) In a diffusive mechanism, diketides that 
have been activated as N-acetylcysteamine thioesters (diastereomers 2a-d) are loaded 
exogenously onto the KS domam. Claisen-like condensation with a C3-unit derived 
fi:om methylmalonyl CoA followed by NADPH-dependent reduction gives the 
corresponding triketide lactone products (3a-d). (B) In an intrapolypeptide 
channeling mechanism, a diketide that is generated by module 1 from propionyl CoA, 
methylmalonyl CoA, and NADPH is passed intramolecularly fi-om ACPI to KS2. 
Subsequent elongation and reduction afford the triketide lactone 3a. (C) In an 
interpolypeptide channeling mechanism, diketides that have been 
chemoen2ymatically attached to ACP4 by S:^ (4a-d) are transferred to the KS domain 
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on a separate polypeptide. Elongation and reduction afford the corresponding 
triketide lactones (3a-d). In all cases suffix "a" refers to the (2S,3i?) diastereomer, 
suffix "b" refers to the (2R,3S) diastereomer, suffix "c" refers to the (2S,3S) 
diastereomer, and suffix *'d" refers to the {2R,3K) diastereomer. See also Figure 1 1 . 
[0078] Figure 1 1 illustrates the four diketides and their corresponding, putative 
enzymatic products. See also caption to FigurelO. 

[0079] Figure 12 presents the reaction schemes of the three bimodular DEBS 
derivatives-Ml+M5+TE (module 1 + module 5 + TE), M1+M6+TE, and M1+M2+TE 
(DEBSl+TE)-and their corresponding kcat values. 

[0080] Figure 1 3 presents a comparison of the kcat/KM values (min ' mM"' ) of the 
two syn-diketides when presented as acyl-ACP substrates (4a and 4b) vs when 
presented as NAC-thioesters (2a and 2b). The kcat/KM values for the NAC-thioesters 
were reporter earlier by Wu, N., et al., J. Am. Chem. Soc, 2000, 122, 4847-4852. 
[0081] Figure 14 presents a comparison of the kcat values (min ' ) of the four 
diketides when presented as acyl-ACP substrates (4a-d) vs when presented as NAC- 
thioesters (2a-d). 'TSr.D." denotes that the product was not detected. 
[0082] Figure 1 5 presents the following: (A) X-ray fihn image of SDS-PAGE gel 
and associated conditions of back-transfer experiments; and (B) proposed mechanism 
for back transfer of an exogenously loaded diketide from the KS of a formally 
downstream module to an upstream ACP. 

[0083] Figure 1 6 is a schematic of 6-deoxyerythronolide B synthase. 
Intermodular paris are shown in color. LD: loading domain; AT:acyltransferase; 
ACP: acyl carrier protein; KS: ketosynthase; KR: ketoreductase; DH: dehydratase; 
ER: enolyreductase; and TE: thioesterase. 

[0084] Figure 17 is a schematic of matched and mismatched linker pairs at 
intermodular interfaces. Figure 17 shows how linker pairs at intermodular interfaces 
are selective, yet interchangeable. The left hand panel shows how matched linker 
pairs (triangular/triangular or square/square) facilitate efficient chain transfer. The 
right hand panel shows how mismatched linker pairs (triangular/square or 
square/triangular) abolish chain transfer without affecting the activity of individual 
modules. 
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[0085] Figure 1 8 is a schematic illustrating the channeling of intermediates to 
unnatural recipient modules. Linker pairs can chaimel intermediates to unnatural 
recipient modules. The left hand panel shows that in addition to communicating with 
its natural partner, module 3 (as shown in Figure 17), module 2 can also communicate 
with modules 5 or 6 as long as linker pairs are matched. The right hand panel shows 
that such commimication is totally abolished if unmatched linker pairs are used. 
[0086] Figure 1 9 is a schematic illustrating the replacement of the entire donor 
module with just the acyl carrier protein. The properties of linkers as shown in 
Figures 17 and 18 are maintained when the entire donor module is replaced with just 
the acyl carrier protein. 

[0087] Figure 20 is table summarizing the substrates and enzymes used to study 
substrate transfer from a donor to a recipient module. Figure 20 shows how matched 
linker pairs are able to efficiently transfer otherwise poor substrates firom a donor to a 
recipient module. Three different diastereomers of the natural substrate of module 2 
were presented to either module 2 or module 5. These substrates were presented as 
N-acetylcysteamine (NAC) thioesters or on an ACP canying a matched linker. In 
every case chain transfer via protein-protein interactions proved to be a useful method 
for substrate delivery. Products A , B, and C were producted at lower rates than the 
other products. 

[0088] Figure 21 shows data supporting a coiled-coil conformation for linkers. 
Circular dichroism and ultracentriguation on synthetic peptides indicate a coiled-coil 
conformation for linkers. The top panel shows circular dichroism (millidegrees) 
versus wavelength (nm) for Mix, M2C, and FullMSN. The bottom left panel shows 
hiA versus radius^ (mm^) for linkers at module 2 C-terminus. The bottom right panel 
shows InA versus radius^ (mm^) for linkers at module 3 N-terminus. 
[0089] Figure 22 is a schematic illustrating how exogenous peptide mimetics of 
linkers can inhibit chain transfer. This figure suggests that the linkers can adopt a 
functional, native conformation. The top panel shows a natural transfer and the 
bottom panel shows how the addition of an exogenous peptide inhibits the transfer. 
[0090] Figure 23 presents a schematic diagram of the biosynthesis of 6-dEB by 6- 
deoxyer5rthronolide B synthase. Deoxyerythronolide B sjoithase (DEBS) catalyzes 
the biosynthesis of 6-dEB (1), the aglycon precursor of the antibiotic erythromycin. 
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DEBS is composed of three polypeptides - DEBSl, DEBS2, and DEBS3 - each of 
which comprises two modules for a total of six modules (modules 1-6). Individual 
catalj^ic domains are represented by circles, and linker regions are represented by 
solid tabs between DEBSl and DEBS2 and between DEBS2 and DEBS3. Each 
module contains three core catalytic domains - ketosynthase (KS), acyltransferase 
(AT), and acyl carrier protein (ACP) - as well as a variable number of optional 
domains - ketoreductase (KR), dehydratase (DH), and enoylreductase (ER). 
Polyketide biosynthesis is initiated by the action of the loading didomain (LDD) at the 
N-terminus of DEBSl, which primes the synthase with C3-subunit derived from 
propionyl CoA. Biosynthesis of 1 then proceeds in an assembly-line fasMon such that 
the incoming polyketide chain is loaded onto the KS of an extending module from the 
ACP of the previous module. This is followed by a decarboxylative condensation 
reaction between the growing chain and a methylmalonyl-derived C3 extender unit 
that has been loaded onto the ACP by the AT. This C-C bond-forming reaction places 
the groAving chain on the ACP, where it can then undergo unique functionalization 
catalyzed by KR, DH, and ER before being passed to the KS of the downstream 
module. This processive cycle of elongation and functionalization occurs until the 
penultimate intermediate reaches the thioesterase (TE), which catalyzes 
macrocyclization and product release to yield 1. 

[0091] Figure 24 A depicts setup and mechanism for the intermodular transfer and 
elongation assay. Diketide-S-CoA is covalently attached to apo-ACP with the 
phosphopantetheinyl transferase Sfp to yield diketide-ACP. When this substrate is 
added to a module with complementary protein-protein interactions, the diketide is 
transferred to the KS of the acceptor module, where in the presence of methylmalonyl 
CoA extender units it will be elongated one time and cyclized to release the six- 
membered triketide lactone. Figure 24 B demonstrates the reaction of diketide- 
ACP4(4) -1- (5)M5+TE as previously reported (Wu, et al., (2001) J. Am. Chem. Soc. 
123, 6465-6474). In this system, the diketide is channeled from the donor ACP4(4) 
protein to the acceptor (5)M5+TE protein. Figure 24 C presents the reaction of 
diketide-S-N-acetylcysteamine + (5)M5+TE as previously reported (Wu, et al., (2000) 
J. Am. Chem. Soc. 122, 4847-4852). In this reaction, the diketide is diffusively 
loaded onto the acceptor (5)M5+TE protein. 
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[0092] Figure 25 presents an SDS-PAGE image of the purified protein substrates. 
Only proteins which have not been previously reported are show. A protein ladder is 
shown in the left-most lane. Lane A: Diketide-ACP2(2). Lane B: Diketide-ACP2(4). 
Lane C: Diketide-ACP2(0). Lane D: Diketide-ACP4(0). Lane E: Diketide- 
NovH(4). 

[0093] Figures 26 A-C provide the modularity of the linker regions in the ACP2- 
module 3 interface. Figure 26A shows the reaction of diketide-ACP2(2) + (3)M3+TE 
with the natural module 2-moduIe 3 linker pair. Figure 26B shows the reaction of 
diketide-ACP2(4) + (5)M3+TE with the altemate module 4-module 5 linker pair. 
Figure 26C demonstrates a representative mass spectra of the purified diketide-ACP 
substrates showing purity as well as complete conversion from the apo-ACPs. Data 
for diketide-ACP2(2) and diketide-ACP2(4) are shown here. In both cases, the major 
peak corresponds to the parent mass minus 177, indicating loss of N- 
formylmethionine 

[0094] Figures 27 A-D present a schematic diagram and kinetic parameters of the 
four combinations of matched and mismatched linker regions and matched and 
mismatched ACP-KS pairs with module 3 as the acceptor module. Figure 27 A shows 
the reaction of diketide-ACP2(2) + (3)M3+TE with matched linkers and matched 
ACP-KS pairs. Figure 27 B presents the reaction of diketide-ACP2(4) + (3)M3+TE 
with mismatched linkers and matched ACP-KS pairs. Figure 27 C demonstrates the 
reaction of diketide-ACP4(4) + (5)M3+TE with matched linkers and mismatched 
ACP-KS pairs. Figure 27 D depicts the reaction of diketide-ACP4(4) + (3)M3+TE 
with mismatched linkers and mismatched ACP-KS pairs. 

[0095] Figures 28 A-F present a schematic diagram and kinetic parameters of the 
four combinations of matched and mismatched linker regions and matched and 
mismatched ACP-KS pairs with module 5 as the acceptor module. Figure 28 A shows 
the reaction of diketide-ACP4(4) + (5)M5+TE with matched linkers and matched 
ACP-KS pairs.Figure 28 B presents the reaction of diketide-ACP4(4) + (3)M54-TE 
with mismatched linkers and matched ACP-KS pairs. Figure 28 C depicts the reaction 
of diketide-ACP2(4) + (5)M5+TE with matched linkers and mismatched ACP-KS 
pairs. Figure 28 D presents the reaction of diketide-ACP2(2) + (5)M5+TE with 
mismatched linkers and mismatched ACP-KS pairs. Figure 28 E shows a 



19 



wo 03/076581 



PCT/US03/06910 



representative time course used to determine k60DM values. The data here 
corresponds to the reaction of diketide-ACP4(4) + (3)M5+TE. All reactions were 
performed in duplicate to confirm reproducibility. Figure 28 F depicts a representative 
liquid scintillation counting data from the competitive assays used to determine 
kcat/KM values. The data shown here corresponds to the reaction of 2 mM VDK- 
SNAC + 25 DM diketide-ACP4(4) + (3)M5+TE. The peak at 15 minutes 
corresponds to the product derived jfrom VDK-SNAC ((2S, 3R)-2-methyl-3-hydroxy- 
S-(N-acetylcysteamine)-heptanethioate), and the peak at 18 minute corresponds to the 
product derived from diketide-ACP4(4). By measuring the initial slope of a v vs. [S] 
plot, the kcat/KM value for VDK-SNAC + (3)M5+TE was previously determined to 
be 0.078 min-lmM-1 (data not shown). All reactions were performed in duplicate at 
different ratios of competing substrates to confirm reproducibility. 
[0096] Figures 29 A-H present linker-less ACP4(0) as the donor protein. Figure 
29 A corresponds to diketide-ACP4(0) + (5)M5+TE. Figure 29 B corresponds to 
diketide-ACP4(0) + (3)M5+TE. Figure 29 C corresponds to diketide-ACP4(0) + 
(5)M3+TE. Figure 29 D corresponds to diketide-ACP4(0) + (3)M3+TE. Figure 29 E 
corresponds to diketide-ACP4(0) + (5)M2+TE. Figure 29 F corresponds to diketide- 
ACP4(0) + (5)M6+TE. Figure 29 G corresponds to diketide-ACP4(4) + (5)M2+TE, 
shown for reference (Wu, et al., (2001) J. Am. Chem. Soc. 123, 6465-6474). Figure 
29 H corresponds to diketide-ACP4(4) + (5)M6+TE, shown for reference (id.). 
[0097] Figures 30 A-B present a qualitative assessment of the ability of various 
donor proteins to transfer diketide substrates to modules 2 and 6, and a representative 
radio-TLC image of such qualitative assays, respectively. For Figure 30 A, in the 
columns, going left to right: diketide-ACP2(0), diketide-eryLDD(0), diketide- 
NovH(0), diketide-NovH(4). In the rows, going down: (5)M2+TE, (5)M6+TE. For 
Figure 30 B, Jfrom left to right, the lanes correspond to the reactions of diketide- 
ACP2(0), diketide-eryLDD(0), diketide-NovH(0), and diketide-NovH(4) with 
(5)M2+TE. All reactions were performed imder the conditions described in the 
Materials and Methods section. The heavy spots at the baseline correspond to 
methylmalonyl CoA and propionyl CoA (derived firom decarboxylation of 
methylmalonyl CoA) that were adventitiously extracted into the organic layers. The 
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spot at Rf = 0.05 in the diketide-eryLDD(0) reaction was not identified. The 
reactions of the same substrates with (5)M6+TE afforded similar raw data. 
[0098] Figures 3 1 A-B represent alignments of the 6 (SEQ ID NOs:35-40, 
respectively). The symbols on the left margin refer to the particular SU, with S 
referring to AT-S or ACP-S. Numbers on the right margin refer to the aa sequence 
position at the end of each row for EryAI (for 1 , 2 and S on the left), in EryAII (for 3 
and 4) and in Erylll (for 5 and 6). Sequences for EryAI and EryAII-EryAIII are from 
Genbank, accession Nos. N63676 and M63677, respectively. Invariant aa residues in 
the six SU ai-e marked by dashes. Dots refer to computer-introduced gaps to maximize 
alignments. Shaded boxes refer to aa residues invariant in the six (or seven) sequences 
from the US, as well as chicken FAS (Holzer et al, 1989; Yuan et al, 1988), rat FAS 
(Amy et al., 1989) and 6MSAS (Beck et al, 1990). Open boxes refer to conservative 
substitutions or invariant residues in all but one sequence. The N terminus of chicken 
FAS is assumed to precede the published sequence (Holzer et ah, 1989), as recently 
reported (Witkowski et al, 1991a). The KR of SU3, when it deviates from the other 
eight sequences, is ignored for boxing purposes. The extent of each domain is 
indicated by underlining of the sequences with solid black bars, short heavy dashes, 
long heavy dashes, and open bars, representing the KS, AT, KR, and ACP domains, 
respectively. The two arrows mark the extra segments of 152 and 3 15 aa present in 
SU4. The shaded bars under the sequences in the region comprised between the two 
arrows indicate invariant and conservative substitutions among the six SU. 

[0099] Other features and advantages of the invention will be apparent from the 
following detailed description, and from the claims. 

[00100] Nomenclature. The nomenclature used in this report for proteins 
containing linker regions is identical to that used previously (Wu, et al., (2001) J. Am. 
Chem. Soc. 123, 6465-6474; Tsuji, et al., (2001) Biochemistry 40, 2317-2325). 
Specifically, the module of origin of the linker is placed in parentheses either before 
or after the name of the domain or module to which it is attached, depending on 
whether it is anN-or a C-terminal linker, respectively. The boundaries of ACP 
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domains, KS domains, and linkers are defined as before Gokhale, et al, (1999) 
Science 284, 482-5; Tsuji, et al., (2001) Biochemistry 40, 2317-2325). For a protein 
whose linker region has been deleted, a null set symbol (0) is placed in the 
parentheses. Accordingly, module 6 that has been engineered with the N-terminal 
linker from module 5 is represented as (5)M6; likewise, ACP2 with no linker regions 
is represented as ACP2(0). If a thioesterase domain is fused to the C-terminal end of 
a module, it is indicated as such (e.g. (5)M5+TE). 

[00101] Reagents and Chemicals. DL-[2-w2e%/-"^C]Methylmalonyl-CoA (56 
mCi/mmol) was purchased from ARC, Inc. All other chemicals were purchased from 
Sigma-Aldrich. Buffer A: 100 mMNaH2P04, 2.5 mM DTT, 1 mM EDTA, 20% 
glycerol, pH 7.1. Buffer B: 100 mM NaH2P04, 10 mM imidazole, 1 MNaCl, 20% 
glycerol, pH 8.0. Buffer C: 400 mM NaH2P04, 1 mM EDTA, 2.5 mM DTT, 20% 
glycerol, pH 7.1. 

Modes of Carrying Out the Invention 

[00102] The invention takes advantage of the identification of the amino acid 
sequences for supplying an appropriate linker between modules of a Type I PKS 
depending on the position of the module in the synthetic scheme for the polyketide. If 
the module is at the N-terminus of the polypeptide in which it resides - i.e., there is no 
additional module covalently bound upstream to it, an "interpolypeptide linker"(ERL) 
is placed upstream of the KS catalytic domain. Conversely, if the module resides in a 
polypeptide wherein there is an additional module upstream of it and covalently 
linked to it as a fusion protein, the two modules should be separated by an 
"intrapolypeptide linker" (RAL). If the module residing at the N-terminus of a 
polypeptide is downstream in the synthesis process for a polyketide - i.e., if it must 
accept a nascent polypeptide chain fi:om a different module not on the same molecule, 
it may be necessary as well to supply a portion of the interpolypeptide linker at the 
C-terminus of the module providing the nascent polyketide chain in order to assxire 
orderly transfer. 

[00103] In the discussion that follows, polyketide synthases are discussed either at 
the protein level or the DNA level. As is well understood, manipulation of the 
sequence of amino acids in the polyketide synthase proteins is most conveniently 
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done using recombinant techniques. Thus, for example, the appropriate linker 
sequences can be mtroduced to or modified with respect to those of an existing 
module by modifying the appropriate gene and expressing it in a suitable host. 
Interchange of linkers is also conveniently done in this manner. Further, 
modifications of amino acid sequences so as to obtain "variants" are effected by 
mutating the gene. The referent polyketide synthase should be understood to exist at 
both the protein level and nucleic acid level, and which form is being discussed 
should be apparent from the context. 

[00104] Further, the action of polyketide synthases on their appropriate substrates 
can be effected either extracellularly by using isolated enzymes or may be effected by 
producing the enzymes intracellularly. By "appropriate substrate" is meant the 
extender units in their thioester forms that are recognized by the various modules in 
the PKS and "starter" units which are either thioesters of carboxylic acids or partially 
synthesized polyketides such as diketides. For example, as described in PCT 
application PCTAJS96/1 1317, the ketosynthase domain of module 1 may 
conveniently be inactivated thus making more efficient the utilization of the diketide 
by module 2. 

[00105] The linkers can be supplied by conventional recombinant DNA 

manipulations through the use of restriction enzymes and ligation procedures 
commonly practiced. The linkers in the PKS of the invention will be "isolated" from 
their natural environments. By "isolated," as used herein, is meant simply that the 
referent is found linked in association with moieties with which it is not normally 
associated, or in an environment in which it is not naturally found. It may be linked, 
if a nucleotide sequence to additional sequence with which it is not normally linked, 
or, if a peptide, to additional amino acid sequence with which it is not ordinarily 
linked, or it may be simply detached fi:om additional moieties with which it is usually 
associated. 

[00106] As seen firom Figure 3, the intrapeptide linkers (RAL) of the invention 
contain approximately 16-20 amino acid and typically contain a proline residue at 
approximately the middle of the sequence. On the other hand, the N-terminal 
upstream interpolypeptide linkers are approximately twice as long and appear to 
contain conserved acidic amino acid residues and basic amino acid residues at 
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positions in the upstream half of the molecule. Thus, typical N-terminal upstream 
interpolypeptide linker (ERL) will contain an acidic amino acid within the first 3-10 
residues, which is followed after 8-10 residues by a basic amino acid, and then after 
another 2-5 amino acid residues by an additional acidic amino acid. Additional acidic 
and basic residues may also occur in these linkers. 

[00107] The intrapeptide linkers or interpeptide linkers shown in Figure 3 can be 
used as described below in the present invention or the corresponding amino acid 
sequences fix)m native Type I PKS in general can be employed. In addition to the 
sequences that occur in nature, "variants" may be used. These variants are obtained 
by altering the amino acid sequence of the linker in minor ways that do not affect the 
ability of the linker to "feed" the nascent polyketide chain to the module in question. 
Typically, such "variants" are obtained from the native sequences by amino acid 
substitution, deletion or insertion; preferably the substitutions are "conservative" 
substitutions - i.e., an acidic amino acid for a different acidic amino acid, a basic 
amino acid for a different basic amino acid, and the like. Preferably, the variants 
contain no more than three amino acid alterations, preferably only two, and more 
preferably only one. 

[00108] For construction of polyketide synthases which contain more than one 
polypeptide, the appropriate sequence of transfers is assured by matching the 
appropriate C-terminal amino acid sequence of the donating module with the 
appropriate N-terminal amino acid sequence of the interpolypeptide linker of the 
accepting module. This can readily be done, for example, by selecting such pairs as 
they occur in native PKS. For example, two arbitrarily selected modules could be 
coupled using the C-terminal portion of module 4 of DEBS and the N-terminal of 
portion of the linking sequence for module 5 of DEBS. 

[00109] In general, the method of the invention involves supplying to a module 
used m a PKS for synthesis of a desired polyketide with the appropriate N-terminal 
upstream portion interpolypeptide linker (N-ERL), C-terminal downstream portion of 
an interpolypeptide linker (C-ERL) or with an intrapolj^eptide linker (RAL) at either 
terminus. As stated above, if the module is at the N-terminal portion of a polypeptide, 
an N-terminal upstream interpolypeptide linker should be appended at its N-terminus. 
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If the module resides in a polypeptide where there is an additional module fused 
upstream from it, the two modules should be separated by an intrapolypeptide linker. 
[00110] For ease of construction, a library of ftinctional modules can be maintained 
to provide the appropriate desired module for construction of the PKS. One way to 
ensure the appropriate sequence of polyketide chain growth is to link the modules 
covalently, so that all but the first module will contain upstream intrapolypeptide 
linkers. Alternatively, and preferably, appropriate commiraication between functional 
modules non-covalently associated on separate polypeptide molecules can be 
achieved by providing appropriate matching between the C-terminal downstream 
portion of the interpolypeptide linker associated with the module contributing the 
nascent polyketide chain and the N-terminal upstream portion of the interpolypeptide 
linker placed upstream of the module wliich accepts and extends this nascent 
polyketide. Thus, an appropriate linker to ensure that the growing polyketide chain 
will be passed from module A to module B, which modules are not covalently bound, 
would be to couple, for example, the C-terminal scaffold portion of module 4 from 
erythromycm to module A and the N-terminal interpolypeptide linker (scaffold) 
portion from module 5 of the erythromycin PKS to the N-terminus of the KS of 
module B. 

[00111] To design and construct the PKS, one straightforward approach is to utilize 
the existing linker regions of a native PKS, such as erythromycin PKS, and simply to 
"plug in" modules, for example from a library. 

[00112] A library of modules derived from naturally occurring PKS which contains 
modules incorporating all alternative extender mits used in native PKS combined 
with all variants of beta-carbonyl modification is not large. Extender units that are 
incorporated naturally include malonyl-CoA, methylmalonyl-CoA, 
ethylmalonyl-CoA, and hydroxymalonyl-CoA. The appropriate native molecule for 
incorporation of each of these can readily be found. Methylmalonyl-CoA extender 
units are incorporated, for example, by the modules of the erythromycin PKS. Certain 
modules of the picromycin PKS incorporate malonyl-CoA, while modules of the 
epothilone PKS incorporate ethylmalonyl-CoA or hydroxymalonyl-CoA. Modules 
occur naturally which contain the full spectrum of beta-carbonyl modifying activities; 
to the extent it is desirable to couple a particular beta-carbonyl modifying activity 
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with a particular extender specificity, this can be accomplished by altering catalytic 
domains, ;7er se, as described in the above-referenced PCT publication WO 98/49315. 
The complete combination of extender unit choices with all beta-carbonyl 
modification choices is thus only a total of 4x4 or 16 modules. As the KS unit 
determines the stereoselectivity of the module, accommodation can be made for 
various stereoisomeric forms of precursor by adjusting the KS domain in the module 
library. This expands the total number of modules necessary only to 32. An arbitrary 
number of modules can be included in a particular PKS construct, thus also 
determining the length of the polyketide chain and the size of the macrolide product. 
Of course, the macrolide product can be modified, if desired, by the knovm tailoring 
enzymes which convert naturally occurring macrolides to hydroxylated and/or 
glycosylated forms and tlie like. Such modification can be achieved in a variety of 
ways - by chemical modification, by in vitro treatment with appropriate enzymes, or 
by feeding the polyketides to a host organism which contains the appropriate tailoring 
enzymes, as is well understood in the art. 

[00113] To construct the desired PKS, modules are selected from the library and 
provided the appropriate upstream intrapolypeptide or inteipolypeptide linkers. 
Suitable linkers can be selected from the group consisting of those shown in Figure 3, 
the corresponding sequences fi-om any Type I PKS, or can include variants of these 
depicted sequences which are conservative in nature and do not interfere with the 
ability of the linker to permit effective uptake of the nascent polypeptide chain. The 
linkers can be added by standard recombinant techniques to the modules in the 
library, or, the library can be composed of the collection of modules wherein each 
module has been fiutiier manipulated to include either an intrapolypeptide or 
interpolypeptide linker. It may be desirable, for example, to provide each type of 
extender module with an intrapolypeptide linker, including the possibility of retaining 
the linker that is normally associated with it. If the linker is placed at the N-terminus 
of the module, the module is suitable for covalent linking downstream of an additional 
module m a single polypeptide. If the mtrapolypeptide linker is at the C-terminus, 
ordinarily that module will be placed and linked covalently upstream of an additional 
module. In any case, a module may arbitrarily be provided with an intrapolypeptide 
linker (RAL) at either its N- or C-terminus dependmg on where it is ultimately to be 
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placed in the PKS to be constructed or may be provided with the N-terminal 
upstream portion of an interpolypeptide linker (N-ERL) if it is to be placed at the 
N-terminus of a polypeptide in the PKS, or with a C-terminal portion of an 
mterpolypeptide linker (C-ERL) if it is to be placed at the C-temiinus of a polypeptide 
and is intended to transfer a nascent polyketide chain to a subsequent module. 
[00114] The various modules, with appropriate Imkers are then assembled into the 
desired polyketide synthase. As stated above, the construction of the PKS can be 
based on plugging in active portions of modules into an existing linker array The 
assembly can be perfonned by simply mixing the peptides containing the modules or 
may be generated recombinantly from expression constructs in a host cell. The cell 
may provide the appropriate substrates for the PKS, or the substrates may need to be 
provided to the reaction mixture containing the polypeptides or to the cells in which 
they are generated. Depending on the choice of host, provision may need to be made 
for providing these substrates. 

[00115] In this way, the modules can be "mixed and matched" as desired to 
construct a polyketide product jfrom the desired extender units and with the desired 
beta-carbonyl modification, choosing the linkers in accordance with the position of 
the module in a polypeptide, and the number of modules cam be altered as desired. 
[00116] A preferred starter unit for such an assembly of modules is a diketide 
thioester either formed in situ by including a module which contains a loading domain 
to incorporate a starter unit along with an extender unit to attain this resultant, or the 
diketide may be syntliesized independently and used as the substrate for the PKS. 
The synthesized diketide may be supplied as the thioester, such as the 
N-acylcysteamine thioesters. Preparation methods for these thioesters are described 
in the above-referenced U.S. Serial No. 09/346,860 filed 2 My 1999 and the 
corresponding PCT appUcation, as well as U.S. Serial No. (Atty. docket 
No. 30062-20032.00) filed 27 January 2000. 

[00117] Using the techniques of the invention, it is thus possible to manipulate 
entire modules and effect efficient cross-talk so as to assure production of the desired 
macrolide. Such techniques can be used, for example, to alter the structure of 
macrolide anti-infectives by, for example, replacing the module 2 of the erythromycin 
gene cluster with module 2 of the tylosin gene cluster, or replacing the erythromycin 



27 



wo 03/076581 



PCT/US03/06910 



module 6 (along with its thioester sequence) with the corresponding module 6 from 
narbomycin. 

[00118] In addition, 14-membered macrolides could be expanded to become 16- 
membered macrolides by fusing modules 2-3 of the tylosin, spiramycin or niddamycin 
modules 2-3 between modules 1 and 3 of the erythromycin synthase or by adding any 
arbitrarily chosen module from other Type I PKS clusters into the synthase for 
production of erythromycin. Altematively, modules 1-2 of erj^hromycin could be 
deleted and replaced by modules 1-3 of tylosin, spiramycin or niddamycin. 
[00119] In addition, new substituents can be introduced into, for example, PKS 
er5^hromycin or its precursors by replacing the second module of the erythromycin 
PKS with module 5 from tylosin PKS where the substituted module has the enoyl 
reductase catalytic activity inactivated. This results in erjiihromycins substituted with 
an ethyl group at the 10-position. Alternatively, erythromycin module 5 could be 
replaced by the spiramycin module 6 to obtain 5-desmethyl-4-OH erythromycins. 
[00120] Improved forms of FK-506 are obtained by replacing rapamycin 
modules 2-10 with FK-506 modules 2-6, or by replacing rapamycin modules 2-1 1 
with FK-506 modules 2-7 or by replacing rapamycin modules 2-12 with FK-506 
modules 2-8 or by replacing rapamycin modules 1 1-14 with FK-506 modules 7-10. 
Any combination or subset of the above could also be employed. Improved forms of 
FK-520 can be made in a similar manner. An alternative form of rapamycin is 
synthesized by substituting the FK-506/520 module 1 for rapamycin module 1 . 
[00121] The foregoing are merely exemplary of the types of manipulations that 
could be employed. The polyketides, obtained by supplying the appropriate substrates 
either in vitro or in vivo, may then be further modified if desired by hydroxylation, 
glycosylation and the like to obtain desired products. Further, chemical synthetic 
manipulations may also be employed. 

[00122] Some of the resultmg compounds described above could be prepared by 
alternative techniques previously disclosed, for example, in PCT applications 
PCT/US99/22886 or PCT/US99/24483. However, the procedure described above, 
which manipulates entire modules, may result in better yield or more convenient 
synthesis. 
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[00123] In addition to housing six modules, the three polypeptides of DEBS each 
possess short, nonconserved segments of amino acid residues located at the N- and C- 
termini of adjacent polypeptides (shown with complementary symbols in Figure 4 and 
described in Example 6 below). A previous study has discussed the importance of 
keeping these interpolypeptide "linkers" intact during the engineering of chimeric 
PKSs (Gokhale, R.S., et al. (1999) Science 284, 482-485. To dissect the precise role 
of these linkers in mediating intermodular interactions, an in vitro system consisting 
of a donor module and an acceptor module was developed and kinetically 
characterized and is described in Example 6 below. Each of the components of this 
functional linker pair could then be replaced with counterparts from other naturally 
occurring linker pairs. The results of these experiments, shown in Example 6 and 
accompanying figures 4-9, support the understanding that the linker regions outside a 
module's conserved catalytic domains impact its interactions with other modules. In 
addition to their importance in ftinctionally connecting modules, these short 
sequences also play an active role in the protein-protein recognition of modules, 
helping to maintain the selective transfer of intermediates. 
[00124] There are several strategies for rationally manipulatmg polyketide 
stiiicture by engineering DEBS. For example, it has been demonsti-ated tiaat DEBS is 
amenable to the introduction of unnatural side chains at the C13 and Cu positions via 
precursor-directed feeding of diketides (Jacobsen, et al., Science 1997, 277, 367-369; 
Jacobsen, et al, Bioorg. Med. Chem. 1998, 6, 1 171-1 177; Hunziker, et al., 
Tetrahedron Lett. 1999, 40, 635-638), as well as via replacement of loading 
didomains from alternative synthases. See Marsden, et al., Science 1998, 279, 199- 
202). In addition, protein engmeering of DEBS can generate truncated polyketides, 
(Kao, et al, Am. Chem. Soc. 1994, 116, 11612; Cortes, et al.. Science 1995, 268, 
1487-1489; Kao, C. M., et al., J. Am. Chem. Soc. 1995, 117, 9105-9106; Kao, et al.,. 
Am. Chem. Soc. 1996, IIS, 9184) epimerized polyketides (Bohm, et al., Chem Biol 
1998, 5, 407-412; Kao, et al., J. Am. Chem. Soc. 1998, 720, 2478-2479; Holzbaur, et 
al., Chem. Biol 1999, 6, 189-195; Bycroft, et al., Biochem. 2000, 267, 520-526), 
desmetiiyl polyketides (Oliynyk, et al., Ckem. Biol. 1996, 3, 833-839; Ruan, et al., J. 
Bacterial. 1997, 179, 6416-6425; Liu, et al.,Am. Chem. Soc. 1997, 119, 10553- 
10554; Lau, et al.. Biochemistry 1999, 38, 1643-1651), polyketides containing various 
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degrees of modification of the P-keto groups (Donadio, et al.. Science 1991, 252, 675- 
679; Donadio, et al., Proc. Natl. Acad Sci. U.S.A 1993, PO, 71 19-7123; Bedford, et 
al, 1996, 3, 827-831; McDaniel, etal.,^7K. Chem. Soc. 1997, 119, 4309-4310; Kao, 
C. M., et ai.. Am. Chem. Soc. 1997, 119, 11339-11340), and combinations thereof. 
See McDaniel, et al., Proc. Natl. Acad. Sci. U.S.A 1999, 96, 1846-1851. However, 
one approach for generating diversity in polyketides that has been exploited only to a 
limited extent (Gokhale, et al.. Science 1999, 284, 482-485; Ranganathan, et al., 
Chem. Biol. 1999, 6, 731-741) is the fusion of intact modules (or groups thereof) from 
different PKSs to generate chimeric assembly lines. While the application of such a 
strategy takes advantage of the natural catalytic grouping of the modules to produce 
enzymes of improved catalytic effectiveness, two major issues must be addressed to 
rationally implement a modular rearrangement strategy for combinatorial 
biosynthesis. First, the molecular recognition features of individual modules need to 
be deciphered, so that their placement in hybrid PKSs can be restricted to catalytically 
productive contexts. Second, the mechanistic basis for transferring intermediates 
between adjacent modules must be understood, so that intermodular chain transfer can 
efficiently occur between heterologous modules. This report provides new insights 
into the relative importance of both of these issues and their interrelationships in the 
context of a multimodular PKS. 

[00125] The tolerance and specificity of individual modiiles of DEBS have been 
indirectly investigated using a variety of genetic, biochemical, and chemical 
approaches.^^ Recently, it has been possible to express and reconstitute individual 
DEBS modules as intact proteins. See Gokhale, et al.. Science 1999, 284, 482-485. 
This allowed us to directly assess the substrate specificities of four modules of DEBS 
(modules 2, 3, 5, and 6) using a set of N-acetylcysteamine (NAC)-activated diketides 
as potential substrates (2a-d, Figure lOA), Wu, et si.. Am. Chem. Soc. 2000, 122, 
4847-4852. Surprisingly, not only did the substrate specificity profiles of these four ^ 
individual modules turn out to be quite similar, but these profiles also did not 
correlate well to the structures of the natural substrates of individual modules. 
Separately, recent experiments have suggested that short intermodular luiker 
sequences play an important role in the selective transfer of polyketide intermediates 
between modules. See Gokhale, et al.. Science 1999, 284, 482-485; Tsuji, et al.. 
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Biochemistry 2001 . Therefore, we considered it appropriate to reexamine the steady- 
state kinetic parameters of individual DEBS modules, but this time, to pay closer 
attention to possible protein-protein interactions that could be involved in passing a 
substrate from an upstream module to its downstream neighbor. 
[00126] There are two modes by which a substrate can be passed from one module 
to the next. If the two successive modules are on the same polypeptide (such as 
modules 1 and 2 of DEBS), there is an intrapolypeptide chain transfer. On the other 
hand, if the two successive modules are on separate polypeptides (such as modules 2 
and 3 of DEBS), there is an interpolypeptide chain transfer. In either case, 
biosynthetic intermediates undergo direct interthiol transfer between adjacent modules 
such that the intermediates never go into bulk solution. We refer to this property as 
the "physical channeling" of intermediates between modules. 

[00127] Physical channeling (also commonly referred to as substrate channeling) is 
defined as a mechanism in a sequence of reactions in which reaction intermediate is 
transferred from one active site to the downstream active site without equilibrating 
with the bulk solution. See Kirsch, et al., Biochemistry 1999, 58, 8032-8037. 
Physical chaimeling of intermediates can provide kinetic benefits by increasing the 
effective concentration of tihie substrate, protecting labile intermediates from 
unproductive reactions, and precluding enfrance of intermediates into competing 
enzymatic pathways. Furthermore, substrate channeling between two enzymes can 
help overcome product inliibition of the upstream enzyme by funneling the 
intermediate out of the upstream binding pocket and into the downstream binding 
pocket more efficiently. 

[00128] While physical channeling is a necessary outcome of ftindamental 
polyketide biosynthetic mechanisms (Donadio, et al.. Science 1991, 252, 675-679; 
Cortes, et al.. Nature 1990, 34B, 176-178), the kinetic advantage, if any, of channeling 
intermediates between modules has not yet been resolved. To elucidate the issue of 
"kuietic channeling" (which is defined as physical channeling that results in a kinetic 
advantage—as measured by kcat sover a diffusive loading mechanism in which the 
intermediate equilibrates in the bulk phase after release from the upstream active site 
and before loading in the downstream active site) in modular PKSs, two new assay 
systems-one to probe intrapolypeptide fransfers and one to probe interpolypeptide 
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transfers—were devised that would more accurately mimic the transfer of a substrate 
from the acyl carrier protein (ACP) of one module to the ketosynthase (KS) of the 
next. These assays are described in further detail in Example 7 below. In the first 
assay system, the loading didomain and module 1 of DEBS generated in situ the 
natural diketide intermediate ((2S,3J?)-2-methyl-3-hydroxy-pentanoyl-S-ACPi), which 
could then be transferred to alternative downstream modules in a bimodular PKS 
context (Figure lOB). By comparing the kinetic parameters of these hybrid bimodular 
systems to those for elongation of the same diketide that has been supplied 
exogenously to the isolated downstream module (Figure lOA), the kinetic benefit of 
channeling intermediates between covalently linked modules could be evaluated. A 
second assay system was developed using a chemoenzymatic method, through which 
alternative diketides were covalently attached to the phosphopantetheine arms of an 
individually expressed donor ACP domain (Figure IOC). Here, the entire diketide-S- 
ACP adduct (4a-d) is a formal substrate for a recipient module, therefore allowing 
investigation of interpolypeptide channeling. (The linker sequence at the C-terminal 
end of the ACP as previously described, see Tsuji, et al.. Biochemistry 2001, was 
included in this construct.) By attaching different diketides to the same ACP, the 
steady-state kinetic parameters for diketide elongation by individual modules (each 
with a TE domain fused to its C terminus to facilitate turnover) could be measured. 
Both assay systems were used to compare the properties of modules 2, 5, and 6 of 
DEBS, three modules that perform the same chemistry with identical stereocontrol, 
albeit on very different substrates (Figure 4). The results of tliese studies are 
described in Example 7. 

[00129] Understanding the factors that control the specificity of intermodular chain 
transfer is fundamental to the ability to rationally engineer novel polyketide sjoithases 
via module swapping. Among the factors to be considered are small molecule 
substrate specificity as well as protein-protein interactions between the donor and 
acceptor modules. It has been previously shown that while individual modules have 
defined specificities for small molecules, there is considerable tolerance toward less 
favored stereochemical configurations (Wu, et al., (2001) J. Am. Chem. Soc. 123, 
6465-6474.). In addition, 30-90 residue linker regions at the N- and C-termini of the 
bunodular polypeptides of DEBS have been identified and shown to contribute to the 
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specificity of intermodular transfers between two proteins (Gokhale, at al., (1999) 
Science 284, 482-5; Tsuji, et al., (2001) Biochemistry 40, 23 17-2325). While these 
linker regions are potentially powerful tools for enhancing specificity at engineered 
intermodular junctions, it is likely that other protein-protein interactions are involved 
in mediating the specificity of chain transfer. One of the most plausible candidates 
for relevant protein-protein interactions is the interaction between the ACP domain of 
the donor module and the KS domain of the acceptor module. These two domains 
presumably dock together as the substrate is channeled firom the ACP to the KS 
domain via a tetrahedral transition state; therefore, a certam degree of spatial 
proximity can be inferred, suggesting the existence and relevance of additional 
protein-protein interactions at the ACP-KS interface. 

[00130] To evaluate the relative contributions of the linker interactions and the 
donor ACP-acceptor KS interactions, we used the assay system illustrated in figure 24 
B. All donor proteins were loaded with the same (2S, 3i?>2-methyl-3- 
hydroxypentanoyl thioester (hereafter referred to as "diketide"), which is derived 
firom 2 and which has been shown to be a good substrate for DEBS modules 2, 3, 5, 
and 6 (Wu, et al., (2000) J. Am. Chem. Soc. 122, 4847-4852). Kinetic parameters 
relating to the substrate transfer, elongation, and release were measured in the 
presence of different combinations of donor ACP's, acceptor modules, and linkers. 
From these data, a distinct pattern emerged, providing the framework for basic groimd 
rules for engineering novel PKSs by module swapping. 
[00131] Modularity of the linker regions is essential for their use in mediating 
imnatural interactions between modules from different sources. That is, engineering 
of the linker regions onto heterologous protein must be accompanied by a minimal 
kinetic penalty. To assess the modularity of the two linker pairs from DEBS (i.e., the 
linker pair at the module 2-module 3 interface and the linker pair at the module 4- 
module 5 interface), kinetic parameters describii^ the transfer firom ACP2 to modrde 
3 were determined for the two reactions in which each matched linker pair was 
inserted into the module 2-module 3 interface (Example 9). Engineering of the 
heterologous module 4-module 5 linker pair into the module 2-module 3 junction had 
no effect on the maximal rate of transfer and elongation as compared to die natural 
module 2-module 3 linker pairs (fiigure 26); this is consistent with a previous 
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experiment examining the transfer over the same interface but using the full module 2 
donor protein (Tsuji, et al., (2001) Biochemistry 40, 2317-2325). However, replacing 
the natural linker pair with the heterologous linker pair increases the Km for the 
ACP2-module 3 reaction by approximately 7-fold. The contrast between the 
uniformity of the keo^xM term (which approximates the maximal rate) and the 
variability of the Km term in the presence of different linker pairs suggests that 
swapping out the natural module 2-module 3 linker pair for the alternate module 4- 
module 5 linker pair perturbs only the initial association-dissociation of the ACP2 and 
module 3. When using the full module 2 protein, the increase in value upon 
swapping in the ahemate linker pair is a more modest 2-fold, suggesting the more 
significant Km effect when using isolated ACP2 may be an artifact of the truncated 
upstream protein. 

[00132] To identify and quantify the relative contributions of various protem- 
protein interactions involved in mediating substrate channeling, we have replaced the 
linkers on two donor AC? domains (ACP2 and ACP4) as well as corresponding 
acceptor modules in a modified version of the minimal donor ACP system that had 
been previously developed (Wu, et al., (2001) J. Am. Chem. Soc. 123, 6465-6474). In 
two independent data sets using the N-terminal modules 3 and 5 as the acceptor 
modules, baseline kinetics parameters were first measured for reactions comprising 
both matched linkers and consecutive ACP-KS domains (figures 27 A and 28 A). 
When either the linker regions or the donor ACP was swapped such tliat either the 
linker pairs or the ACP-KS domains were now mismatched, comparable attenuation 
of kinetic parameters was observed (figures 27 B-C and 28 B-C), indicating that for 
modules 3 and 5, the ACP-KS interactions and linker interactions contribute 
comparably to the specificity of intermodular chain transfer (Example 10). 
[00133] The reactions of Imkerless ACP4 (i.e., ACP4(0)) with (5)M5+TE and 
(3)M5+TE (figures 29 A-B) demonstrated comparable kinetic parameters to the 
reactions between ACP4 and module 5 comprismg mismatched luikers (figure 28 B) 
(Example 1 1). This indicates that eliminating linker interactions through mismatched 
linkers is kinetically comparable to eliminating linker interactions through physical 
deletion of the region. Furthermore, the kinetic effects observed in the mismatched 
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linker reactions are a result of the elimination of protein-protein interactions rather 
than an artifact of protein engineering. 

[00134] Whereas the KS domains of the N-terminal modules 3 and 5 are specific 
for their natural upstream ACP domains, the KS domains of the C-terminal modules 2 
and 6 are promiscuous towards heterologous upstream ACP domains. ACP4(0) was 
observed to be capable of transferring substrates to both (5)M2+TE and (5)M6+TE, 
despite the absence of matched linker interactions (figure 29 E-F). Kinetic analysis of 
these two reactions indicate that the attenuation of kinetic efficiency and specificity 
compared to the corresponding reactions comprising matched linkers (i.e., ACP4(4) + 
(5)M2+TE and ACP4(4) + (5)M6+TE) can be accounted for entirely by the 
elimination of linker interactions. The dichotomy between N-terminal modules (e.g., 
modules 3 and 5) and C-terminal modules (e.g., modules 2 and 6) of DEBS can 
perhaps be rationalized in the context of their natural positions in the assembly line. 
N-terminal modules such as DEBS modules 3 and 5 naturally accept incoming 
substrates firom an upstream module on a different polypeptide. Therefore, built-in 
specificity for donor ACP domains would be highly advantageous for maintaining 
specific intermodular transfers. On the other hand, C-terminal modules such as 
modules 2, 4, and 6 natiu-ally accept incoming substrates firom covalently attached 
upstream modules, making specificity between the donor ACP and the acceptor 
module less essential. 

[00135] The generality of the tolerance of modules 2 and 6 for unnatural donor 
ACP domains was elaborated using the linkerless, heterologous ACP domains 
ACP2(0) and eryLDD(0) (Example 12). In all tested cases, channeling was observed 
even in the absence of matched linkers and consecutive ACP-KS pairs (figure 30 A). 
A natural extension of these observations is to explore the tolerance of these KS 
domains for peptidyl carrier protein (PCP) domains derived firom nonribosomal 
peptide synthetases (NRPSs). While PCP and ACP domains share similar three- 
dimensional structural folds and are fiinctionally analogous, the homology of PCPs to 
ACPs is generally relatively low (approximately 15-30%), contributing to very 
disparate surface polarities (Crump, et al., (1997) Biochemistiy 36, 6000-6008; 
Weber, et al., (2000) Struct. Fold. Des. 8, 407-418). There are also numerous 
examples of hybrid NRPS-PKS gene clusters in which PCP domains transfer 
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substrates to KS domains. (Schwecke, et al., (1995) Proc Natl Acad Sci USA 92, 
7839-7843; Beyer, et al., (1999) Biochim Biophys Acta 1445, 185-195; Cane, et al., 
(1999) Chem Biol 6, R319-R325; Paitan, et al., (1999) J. Mol. Biol. 286, 465-474; 
Bender, et al., (1999) Microbiol. Mol. Biol. Rev. 63, 266; Quadri, (2000) Mol. 
Microbiol. 38, 1-12; Molnar, et al., (2000) Chem Biol 7, 97-109; Julien, et al., (2000) 
Gene 249, 153-160; Du, et al., (2000) Chem Biol 7, 623-642; Tillett, et al., (2000) 
Chem Biol 7, 753-764; Nishizawa, et al., (2000) J. Biochem. 127, 779-789; Moffitt, et 
al., (2001) FEMS Microbiol. Lett 196, 207-214; Huang, et al., (2001) Microbiology - 
UK 147, 631-642; Schwarzer, et al., (2001) Natiirwissenschaften 88, 93-101). 
[00136] NovH comprises adenylation (A) and peptidyl carrier protein (PCP) 
domains and is involved in the formation of the coumarin ring in the biosynthesis of 
novobiocin. As there are no PKS genes in the novobiocin gene cluster, it is assumed 
tliat this A-PCP didomain does not naturally interact with any PKS proteins during 
novobiocin biosynthesis. While NovH(0) failed to channel substrates to (5)M2+TE 
or (5)M6+TE in the absence of matched linkers (figure 30 A), interaction between 
NovH and modules 2 and 6 could be effected by engineering the C-terminal linker of 
DEBS module 4 onto the end of NovH such that the resulting NovH(4) protein was 
capable of efficiently transferring substrates to modules 2 and 6. Although an 
artificial intrapolypeptide NRPS-PKS interface has previously been created by 
replacing the DEBS loading didomain with the rifamycin synthetase A-PCP loading 
didomain (Pfeifer, et al., (2001) Science 291, 1790-1792), the rifamycin A-PCP 
didomain naturally interacts with PKS domains on the same polypeptide, indicating 
that it may be inherentiy more amenable to engineering into alternate NRPS-PKS 
junctions. In contrast, this experiment with NovH(4) is to our knowledge the first 
example of engineering a functional NRPS-PKS interfece involving an NRPS domain 
that does not naturally interact with any PKS proteins and a PKS domain that does not 
natiirally interact with any NRPS proteins. While this experiment biases the transfer 
reaction by eliminating the small molecule recognition component of a true NRPS- 
PKS transfer, it indicates that the heterologous linker regions are sufficient for 
inducing interaction between two naturally non-interacting proteins and illustrates the 
potential of these linker regions for future engineering of artificial interpolypeptide 
junctions. 
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[00137] The aggregate of these data provides basic ground rules for the 
development of novel polyketide synthases via module replacement. As mentioned 
above, it has been previously demonstrated that linker pairs can be powerful tools for 
creating specificity in artificial interpolypeptide junctions (Gokhale, et al., (1999) 
Science 284, 482-5; Tsuji, et al., (2001) Biochemistry 40, 2317-2325). However, it is 
also essential to consider the origin of the modioles in the engineered junction as well 
as the modules in any competing junctions. Whereas natural interpolypeptide 
junctions comprise a C-terminal module that channels substrates to an N-terminal 
module (represented as C -> N), artificial junctions should be designed to represent 
one of the other three combinations (N N, C -> C, or N -» C) in order to maximize 
specificity in the engineered assembly line. 

[00138] The present invention is fiirther described by the compounds and methods 
described in the following examples. The examples are provided solely to illustrate 
the invention by reference to specific embodiments. These exemplifications, while 
illustrating certain specific aspects of the invention, do not portray the limitations or 
circumscribe the scope of the disclosed invention. 
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Preparation A 
Construction of Single Module Based Systems 

Single Module Gene Constructs 

[00139] Single module constructs from the DEBS gene cluster were prepared for 
modules 2, 3, 5 and 6 as follows. The TE domain is fused to the module to facilitate 
termination. The (M3+TE) gene was prepared from the tri-modular construct 
pKAOBlS (McDaniel, R., et al, Ckem. Biol. (1997) 4:667) having an Me/ site 
engineered at the start of the DEBS-2 gene. Fusion of the TE gene at the end of 
ACP3 was described in connection with the constraction of pCK13 in Cortes, J., 
etal., Science (1995) 268:1487; Kao, CM., et al, J. Am. Chem. Soc. (1995) 117:9105 
and Kao, CM., et al, J. Am. Chem. Soc. (1996) 1 18:91 84, collectively cited below as 
the "Cortes-Kao documents." The Nhel-EcoRI fragment was cloned into pET 21c 
(Novagen) to construct pRSG34. The EcoRI site was used to delete the stop codon of 
the TE domain so that the protein could be overproduced as a carboxy terminal (His)6 
tagged fusion protein. 

[00140] (M5+TE) was constructed by combining the engineered Ndel site from 
pJRJlO (Jacobsen, etal, Biochem (1998) 37:4928) with the ^coi?/ site frompCKlS 
(Cortes-Kao documents). The Nde-EcoRI&agpient was cloned in pET21c to obtain 
the expression plasmid pRSG46. Expression constructs for (M2+TE) and (M6+TE) 
were prepared similarly using an engineered Nhe site immediately upstream of the 
corresponding KS (at position 7570, 5'-GCTAGCGAGCCGATC-3' (SEQ ID N0:1) 
and at position 28710, 5'-GCTAGCGACCCGATC-3' (SEQ ID NO:2)). 
[00141] These constructs were expressed in E. coli BL21 (DE3) along with an 
expression system for sfip phosphopantetheinyl transferase from B. subtilis. The 
co-expression is described by Lambalot, R.H., et al, Chem. Biol (1996) 3:923. For 
the construction of the gene, the Ndel-Hindlll fragment derived from the 
pUC8-jj$7 (Nakano, etal, Mol Gen. Genet. (1992) 232:313) was cloned into pET28 
which has a kanamycin resistance gene to give resultant plasmid pRSG56. The 
resulting proteins were then isolated for use in the reaction mixtures described in the 
Examples below. 
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[00142] In more detail, the expression was induced with 1 mM 
isopropyl-b-D-thiogalactopyranoside, and the cells were harvested by centrifugation 
after 10 hours and resuspended in disruption buffer, 200 mM sodium phosphate 
pH 7.2, 200 mM sodium chloride, 2.5 mM dithiothreitol, 2.5 mM sodium 
ethylenediamine tetra-acetate (EDTA), 1.5 mM benzamidine, 2 mg/L pepstatin and 
leupeptin and 30% v/v glycerol. The cells were lysed by passing through a ftench 
press, and the lysate was collected after centrifiigation. Nucleic acids were 
precipitated with polyethylenimine (0. 1 5%) and removed via centrifugation. The 
supernatant was made 50% (w/v) saturated with ammonium sulfate and precipitated 
overnight. After centrifugation, the pellet containing protein was redissolved in 
buffer A (100 mM sodium phosphate pH 7.2, 2.5 mM DTT, 2 mM EDTA and 20% 
glycerol (v/v)) and stored at -80°C. For chromatography, the buffer was exchanged to 
buffer A + 1 M ammonium sulfate using a gel filtration PDIO (Pharmacia) colunm. 
The resulting sample was loaded on a Butyl Sepharose (Pharmacia) colunm. 
Fractions containing DEBS proteins were pooled and applied on an anion exchange 
column (Resource Q; 6 mL, Pharmacia). Purified protein fractions were pooled and 
concentrated using Amicon centriprepSO. Typical purified protein yields were 
-3-4 mg/liter of culture. Greater than 90% of proteins were phosphopantctheinylated 
in vivo as a resuh of the overexpression of sfp phosphopantetheinyl transferase. 
Although the proteins were expressed as (His)6-tagged proteins, they did not bind to a 
Ni-column under experimental conditions. It is unclear whether this inability to bind 
to aNi-agarose colvunn is due to steric effects or if the (His)6 peptide was lost during 
purification. 

Example 1 

Requirements for Cell-Free Synthesis of Triketides by Individual Modules - 
Identification of Luiker Regions 

[00143] A cell-free system, tested for the ability to convert the cysteamine thioester 
of 2S,3R-2-methyl-3-hydroxypentanoic acid (compound 2 in Figure 2), consisted of 
0.5-10 mM concentration of the thioester, 2.5 mM ^''C-labeled methylmalonyl CoA 
and 100 pmoles of purified protem prepared in Example 1 m a 100 |jJ reaction, hi 
some cases, 1 mM of NADPH was added; this was done in assay mixtures containing 
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(M2+TE), (M5+TE) and (M6+TE) proteins. The protein was, in each case, a single 
module of the DEBS PKS fused to the thioesterase (TE) termination region of 
module 6. 

[00144] The reaction mixtures were quenched and extracted by ethyl acetate and 
separated by thin-layer chromatography (TLC) to discern the formation of the 
triketide ketolactone 3 and triketide lactone 4 (both shown in Figure 2) after 
30 minutes. Results are shown as the first four entries in Table 1 . 



Table 1 

Formation of Triketides by Single Module Constructs 





Construct 


Plasmid 


Triketide Formed 


1. 


M3+TE 


pRSG34 


Yes 


2. 


M5+TE 


pRSG46 


Yes 


3. 


M2+TE 




No 


4. 


M6+TE 




No 


5. 


ERL-M2+TE 


pRSG64 


Yes 


6. 


ERL-M6+TE 


pRSG54 


Yes 


7. 


M1-RAL-M6-TE 


pST96 


Yes 


8. 


M1-RAL-M3-TE 


pST97 


Yes 


9. 


ery Ml-RAL-n/M5+TE 


pSTllO 


Yes 


10. 


{ery Ml-RAL rifM5+ERL)+DEBS-2+DEBS-3 


pST113 


Yes (dEB6) 



[00145] As seen in Table 1 , although the expected triketides were formed from 
(M3+TE) and (M5+TE) (modules which reside at the upstream portion of their 
respective polypeptides), no triketides were formed fi-om (M2+TE) or (M6+TE), 
(modules which reside at the C-terminal portions of their polypeptides). These latter 
results were unexpected since the diketide can be incorporated by module 2 when it is 
supplied as a part of the complete polypeptide DEBS-1 . It was verified that the ACP 
domain was pantetheinylated in modules 2 and 6, and that for (M2+TE), the KS 
domain could not be acylated with radiolabeled diketide. 

Example 2 

Modification of Single Modules with Linker Sequences 



[00146] The constructs for (M2+TE) and (M6+TE) were modified by deleting the 
sequences encoding the amino acids upstream of the KS catalj^ic domain and 



wo 03/076581 



PCT/US03/06910 



substituting the first 39 amino acids from (M5+TE) containing the N-terminal portion 
of the interpolypeptide linker (N-ERL). The relevant constructs were prepared by 
replacing the BsaBI-EcoRI ftagaient in pRSG46 by the corresponding fragment from 
pCK4 to obtain (ISf-ERL-M2+TE), in plasmid pRSG64, or from pJRJlO to obtain 
(N-ERL-M6+TE) in plasmid pRSG54. These constructs yield modules which contain 
the upstream 39 amino acids from module 5. The constructs were expressed in E. coli 
and proteins obtained as described in Preparation A. These proteins were able to 
produce the triketide product from diketide in the cell free system of Example 1, as 
shown in entries 5 and 6 in Table 1 . 

[00147] The various constructs which are successful in converting diketide to 
triketide were then evaluated for the kmetic constants kcat and Km- These resuhs are 
shown in Table 2. As shown in Table 2, the results are quite sunilar for all constructs 
except that the results from module 3 show a several-fold decrease in kcat as compared 
to the other modules. This is evidently due to the absence of beta-carbonyl modifying 
enzymes in module 3 as verified by the fact that removal of NADPH, (which is 
required for the activity of such modules) from the reaction mixture of 
(N-ERL-M6+TE) also results in a lowering of the kcat- 

Table 2 

Proteins kcat X 100 (min"^) Km (mM) 

(N-ERL-M2+TE) 8 ± 0.6 4.6 ± 0.4 

(M3+TE) 1 .5 ± 0.3 4.4 ± 0.4 

(M5+TE) 7.5 ± 0.7 4.7 ± 0.4 

(N-ERL-M6+TE) 9.5 ± 0.6 4.3 ± 0.4 

(N-ERL-M6+TE) 4.5 ± 0.7 4. 1 ± 0.4 

(- NADPH) 

[00148] It is apparent from these results that the presence of the N-terminal 
upstream sequence associated with modules located at the N-terminal portion of the 
polypeptide is essential for permitting a module in this position to incorporate the 
growing polyketide cham. 
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Example 3 

Construction of rMl-RAL-M3+TE) and (M1-RAL-M6+TE) 

[00149] The BsaBI-EcoRI fra^ents containing modules 3 and 6 respectively were 
cloned behind the Ml module which contains the intrapolypeptide linker (RAL) that 
natively resides between Ml and M2. The resulting M1-RAL-M3+TE and 
M1-RAL-M6+TE genes were then excised as PacI-EcoRItcagpients and inserted into 
pCK12 resulting in plasmids pST97 and pST96 respectively. The corresponding 
proteins were produced by transformation into S. coelicolor CH999. The resulting 
strains of S. coelicolor were able to incorporate the diketide thioester into the triketide 
as shown by entries 7 and 8 in Table 1 . (The triketide produced is the ketolactone 3 in 
Figure 2.) 

Example 4 

Additional Intrapolypeptide Mediated Transfer 

[00150] A constiiact wherein the furst module of the DEBS PKS cluster (ery), 
which contains the intrapolypeptide linker of the corresponding M1-M2 polypeptide 
from the erythromycin PKS, is fused to the fifth module of the rifamycin PKS {rif) 
was constructed by replacing the natural sequence at 28024 of rif ACP5 
(5 '-CGCGAC-3 ') with the Spel recognition sequence 5 '-ACTAGT-3 ' . The 
BsaBI-Spel fragment containing rif M5 was excised and replaced the corresponding 
ery Ml-RAL- fragment in pCK12 to obtain plasmid pSTl 10. This plasmid, 
containing ery Ml-RAL-ny^M5+TE was transformed into S. coelicolor CH999 and 
the resulting strain was able to incorporate the diketide into the tiiketide lactone as 
shown by entry 9 in Table 1 . The amount is comparable to that produced in this strain 
ti-ansformed with DEBS-l+TE. 

Example 5 

Construction of Modules for Intermolecular Transfer 

[00151] The Pac/-5^e/ fragment of pSTl 10 was inserted uito a derivative of pCK7 
(Kao, CM., et al, Science (1994) 265:509) which had an Spelsi^ engineered at the 
beginning of the scaffolding sequence at the carboxy terminus of the polypeptide 
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downstream of ACP2. The resulting pSTl 13 construct still contains ery Ml linked to 
rif M5 via the natural intrapolypeptide linker between ery molecules 1 and 2, and also 
now contains rif M5 covalently linked to the downstream C-terminal portion of the 
ERL derived from ery M2. Thus, the complete ERL between the polypeptide 
generated by pSTl 13 and the protein generated by a construct which generates 
DEBS-2 would correspond to the native ERL in the ery PKS - i.e., rif M5 would be 
associated with ery M3 via the natural interpolypeptide linker between ery 
molecules 2 and 3. Co-transformation into S. coelicolor of pST113 along with 
constructs that produce DEBS-2 and DEBS-3 results in the production of 6-dEB, as 
shown by entry 1 0 of Table 1 . 

Example 6 

Conistruction of Modules for Interpolypeptide Transfer With Matched and 
Mismatched Linker Pairs (Ml and M3+TE. M2 and f5W3+TE, M2(4) and M3+TE. 
and M2(4') and fSMS+TE) 

[00152] Reagents and Chemicals. DL-[2-we%/-'V]Methylmalonyl-CoA (56.4 
mCi/mmol) was obtained from ARC, Inc. 

[00153] The N-terminal linker of M3 was synthesized by New England Peptide 
(Fitchburg, MA). The peptide sequence was as follows, M3 N-term: H2N- 
MTDSEKVAEYLRRATLDLRAARQRIRELESD-amide (SEQ ID N0:3). 

[00154] Construction of Plasmids. Plasmid pBP19 contains module 2 of DEBS 
(M2) and is a derivative of pRSG64 (Gokhale, R. S., et al, (1999) Science 284, 482- 
485), where the thioesterase domain was replaced with a Spel-EcdRl fragment 
containing the natural C-terminal linker for module 2 to make pBP19. Plasmid 
pST179 encodes a derivative of M2 containing the C-terminal linker of DEBS module 
4 (M4). The C-terminal linker of M4 was obtained as a Spel-Eco^ fragment by PCR 
using the primers 5'-ACT AGT AGG CTG TTC GCG GCC TCA C-3' (SEQ ID 
N0:4) and 5'-G GGA ATT CAG GTC CTC TCC CCC GC-3' (SEQ ID N0:5) 
(bold sequences complement DEBS sequence). The PCR amplicon was inserted after 
M2 using the engineered sites, yielding pST179. This plasmid, pRSG34, encodes 
module 3 of DEBS (M3) with its own N-terminal linker and with the thioesterase 
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fused to the C-terminus. Its construction has been described previously {id). Plasmid 
pST132 encodes a derivative of M3 + TE, where the natural N-terminal linker of 
pRSG34 has been replaced with the N-terminal linker of module 5 of DEBS (M5). 
This substitution required the replacement of the Ndel-BsaBl fragment of pRSG34 
with the corresponding fragment from pJRJlO (Jacobsen, J. R., et al, (1998) 
Biochemistry 37, 4928-4934). All constructs were cloned into pET-21c (Novagen) 
vectors for expression in Escherichia coli. 

[00155] Strain and Culture Conditions. Expression of the desired proteins was 
achieved by transforming the above plasmids into an engineered strain of E. coli 
BL21(DE3) containing the sfp phosphopantetheinyl transferase gene from Bacillus 
suhtilis (Lambalot, R. H., et al, (1996) Chem. Biol 3, 923-936). The sfp gene 
product was required to posttranslationally modify the acyl carrier protein (ACP) 
domains by phoshopantetheinylating the apo-ACP (Gokhale, R. S., et al,(l 999) 
Science 284, 482-485). Cells containing the expression plasmids were selected with 
carbenicillin and used to inoculate a 10-20 mL LB medium starter culture grown at 
37°C. After 6 h, the cells were pelleted and used to inoculate tvyo 2 L flasks 
containing 1 L of LB medium each. The flasks were shaken at 250 rpm at 37°C until 
the culture optical density at 600 mn (ODeoo) was 0.6. The flasks were placed in a 
water bath to cool the cells to 22 C (ca. 10 min) and then induced with 0.5 mM 
isopropyl P-D-thiogalactopyranoside at an ODeoo = 0.8. Flasks were then stirred at 
22-24°C for 10-14 h. 

[00156] Purification of Proteins. After induction, the cells were harvested via 
centrifiigation and washed in 50 mM Tris (pH 8) and 1 mM 
ethylenediaminetetraacetic acid (EDTA) before being resuspended in disruption 
buffer [200 mM sodium chloride, 200 mM sodium phosphate, 2.5 mM dithiothreitol 
(DTT), 2.5 mM EDTA, 1.5 mM benzamidine, pepstatin and leupeptin (2 mg/L), and 
30% (w/v) glycerol]. The cell suspension was lysed at 1250 psi using a French press 
and then centrifuged. Polyethylenimine was added to the supernatant to 0.15% to 
precipitate nucleic acids. Following the centrifiigation (20 min at 33300g-) to remove 
the nucleic acids, ammonium sulfate was added to the supernatant until a 50% (w/v) 
saturation was achieved and allowed to precipitate for 2-3 h. The pellet following a 
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45 min centrifugation (33300g) was resuspended in buffer A [100 mM sodium 
phosphate (pH 7.2), 2 mM DTT, 1 mM EDTA, and 20% (v/v) glycerol]. The 
resulting suspension was applied in 2.5 mL aliquots to a 9.1 mL gel filtration column 
(PD-10, Pharmacia) equilibrated with buffer B (buffer A + 1 M ammonium sulfate) 
and eluted in 3.5 mL of buffer B. This eluant was applied to a 30 mL hydrophobic- 
interaction column (Butyl-Sepharose 4 FastFlow, Pharmacia) at 1 mL/min. Elution 
was performed at 1 mL/min with stepwise changes in buffer starting from 100% 
buffer B, to 40%, 20%, and 0%. Steps were made when the absorbance at 280 nm 
approached baseline. Fractions were 10 mL, and those containing the protein of 
interest (typically eluted with 0% buffer B) were pooled and applied to an anion- 
exchange colunm (Resource Q, 6 raOL, Pharmacia) at 1 mL/min. A gradient of 0-0.15 
M NaCl in buffer A was run at 1 mL/min for 3 column volumes, followed by a gentle 
gradient of 0.15-0.30 M NaCl at 1 mL/min for 10 colunm volumes. Fractions of 2 
mL were collected, and those containing concentrated protein (typically 0.22-0.25 M 
NaCl) were pooled and fiuther concentrated on Centriprep 50 membranes (50 kDa 
molecular mass cutoff; Amicon) to a concentration of 0. 1-4 mg/mL. Protein 
concentrations were measured via the modified Lowry assay (Sigma) and 
densitometric analysis of SDS-PAGE gels stained with Cooraassie Blue. On the basis 
of the densitometry data, aU proteins were determined to be >90% pure. 

[001571 In Vitro Polyketide Production. Assays of individual modules contained 
l.OuM protein, 1-10 mM iV^acetylcysteamine thioester of the "natural" (2S',3i2)-2- 
methyl-3-hydroxypentanoic acid diketide (NDK) (1), 4 mM NADPH, 440 mM 
sodium phosphate, 1 mM EDTA, 2.5 mM dithiothreitol (DTT), and 20% w/v 
glycerol, pH 7.2, in 80^L. Reactions with M2 or M2(4) included 0.3 mM ^''C- 
methylmalonyl-CoA (specific activity adjusted to 10.4 mCi/mmol), and those with 
M3 + TE or (5)M3 + TE included 0.5 mM "^C-methylmalonyl-CoA (specific activity 
reduced to 1 .1 mCi/mmol). [M2(4) refers to a derivative of M2 in which the C- 
terminal linker has been replaced with its counterpart from module 4, whereas (5)M3 
refers to a derivative of M3 in which the N-terminal linker has been replaced with its 
counterpart from module 5 (see Figure 4).] Both concentrations of methylmalonyl- 
CoA were saturating, but different concentrations were needed due to the disparate 
rates of turnover between the modules with and without the thioesterase. The reaction 
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mixtures were preincubated to 30°C and started with tlie addition of the 
methyhnalonyl-CoA. The reactions were incubated at 30°C, and at three to four time 
points, 20nL aliquots were removed and quenched with ethyl acetate, which was 
extracted twice (450jxL total) to isolate the polyketide products. The ethyl acetate 
layers were pooled, dried in vacuo, and applied to an analytical thin-layer 
chromatography (TLC) plate (Si250F; Baker). The TLC plate was developed using 
50% ethyl acetate in hexanes as the mobile phase (triketide ketolactone 4, Rj= 0.5; 
triketide lactone 3, Rf= 0.4) and then visualized by electronic autoradiography 
(Instantlmager, Packard Instruments). Product formation was quantified by 
comparison with standards of the labeled methylmalonyl-CoA. The identity of 
products 2, 3, and 4 has been vinambiguously established in earlier in vitro enzymatic 
studies (Pieper, R., et ah, (1995) J. Am. Chem. Soc. 117, 1 1373-1 1374; Pieper, R., et 
al, (1997) Biochemistry 36, 1846-1851) and was reconfirmed by chromatographic 
comparison with authentic reference samples. 

[001581 Assays of M2 and M3 + TE contained 1 .OnM M2 and 0.4-4nM M3 + TE, 
7 mM NDK, 0.5 mM ''*C-methylmalonyl-CoA (specific activity reduced to 3.4 
mCi/mmol), 4 mM NADPH, 440 mM sodium phosphate, 1 mM EDTA, 2.5 mM 
DTT, and 20% w/v glycerol, pH 7.2, in 70nL. Assays of M2(4) and (5)M3 + TE were 
identical, except they contained 0.5p.M M2(4), 0.5-5|aM (5)M3 + TE, and 0.4 mM 
''^C-methylmalonyl-CoA (specific activity reduced to 6.2 mCi/mmol). The 
concentration of M2(4) was limiting in order to facilitate its saturation with (5)M3 + 
TE. The reactions were prewarmed at 30°C and initiated by the addition of the 
methylmalonyl-CoA. As described above, 20 nL aliquots were removed at various 
time points and processed. Extracts loaded onto TLC plates were separated using 
either 80% ethyl acetate in hexanes or 5% methanol in dichloromethane, botii of 
which allowed identification of the tetraketide lactone 2 and the triketide lactones 3 
and 4. 

[00159] Inhibition of Tetraketide Production. The ability of the synthetic 
peptide to inhibit the transfer reaction, and thus the production of tetraketide, was 
tested under the same reaction conditions described for the two module coincubations. 
The concentrations of M2 and M3 + TE were both 1.0|aM, and the concentrations of 
M2(4) and (5)M3 + TE were 0.5 and 1.0p,M, respectively. The only difference was 
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the addition of the peptide at concentrations ranging from 1 to 100|jM to the assay 
containing M2 and M3 + TE [or alternatively M2(4) and (5)M3 + TE]. For greater 
accuracy, each inhibition assay was performed side by side with a control lacking 
inhibitor. The effect of the inhibitor was thus determined by dividing the inhibited 
rate by the control rate. 

[00160] Kinetic Analysis. For individual modules, the steady-state turnover 
number was determined from the time course of triketide formation, normalized to the 
concentration of protein. The dependence of the rate on substrate concentration was 
measured by varying the concentration of NDK while maintaining saturating levels of 
NADPH and methylmalonyl-CoA. From these data, the hat and Km were calculated 
by fitting the normalized v versus [S] plots to tlie Michaelis-Menten equation. 
[00161] For tetraketide formation, the rate of production of tetraketide was 
recorded for varying concentrations of M3 + TE [or (5)M3 + TE] at a fixed 
concentration of M2 [or M2(4)] and saturating concentrations of substrates. By fitting 
the rate dependence of tetraketide to a saturation curve, the maximal velocity (vmax) of 
tetraketide production was determined, and was assumed to represent the case where 
every M2 homodimer was productively associated with an M3 + TE homodimer. 
Thus, the affinity of this protein-protein interaction could be calculated from the rate 

v=kjim-Md} ay 

of tetraketide formation, as represented in Equation 1 

([M2], [M3], and [M2-M3] refer to the concentrations of M2, unbound M3 + TE, and 
the M2/M3 + TE complex, respectively). Since [M2-M3] is related to the Kd of M2 
and M3 + TE as shown in Equation 2, 
[M2][M3] ^ 

which can be rearranged to yield Equation 3, 
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where [M2]o = total concentration of M2, the velocity of tetraketide production can be 
defined relative to the Kq: 

where v = jtcat[M2-M3] and Vmax = ^cat[M2]o. Thus, fitting of the v versus [M3] plot 
(which is equivalent to the bound M3 + TE versus free M3 + TE plot used for 
Scatchard analysis) to Equation 4 allowed determination of the for M2 and M3 + 
TE association. 

[00162] CD Spectroscopy. The CD spectrum of the M3 N-terminal peptide was 
recorded in a l-mm path-length cell at a sample concentration of lOOuM in 
phosphate-buffered saline (PBS; 0.15 M KCl, 25 mM phosphate, pH 6.9). 
Measurements were made using an Aviv 62DS spectropolarimeter. Concentration 
was determined by tyrosine absorbance at 275 lun in 8 M guanidine hydrochloride. 

[00163] Kinetic Analysis of Individual Modules. To directly measure the effect 
of linker replacement, individual modules with substituted linkers were kinetically 
characterized using the natural diketide (NDK) as substrate. The only difference 
between the M3 + TE and (5)M3 + TE proteins was that the former contained the N- 
terminal linker of M3j whereas the latter contained the N-terminal linker of M5. As 
shown in Figure 5, the modules displayed very similar kinetics. The calculated 
values were 0.71 ±0.1 and 0.68 ±0.1 min"^ and the Ku values were 2.8 ±1.5 and 2.5 
± l.OuM for M3 + TE and (5)M3 + TE, respectively. Similar experiments were 
performed to compare M2 and M2(4), where the C-terminal linker of M2 had been 
swapped. Since the rate of turnover of NDK to triketide lactone 3 by module 2 
lacking the fused thioesterase domain was low, however, only an approximate ^cat 
could be estimated [0.02 and 0.03 min"^ for M2 and M2(4), respectively]. Thus, 
replacement of either the N-terminal or the C-terminal linker of a module does not 
appear to affect its intrinsic catalytic properties. 
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[00164] Kinetic Analysis of M2-M3 Coincubations. Upon coincubation of M2 
and M3 + TE in the presence of NDK, methylmalonyl-CoA, and NADPH, tetraketide 
lactone 2 was fonned (Figure 6) with a maximiim rate constant of 0.27 ± 0.01 min'^ 
The kcat of M2 with M3 + TE was determined from the saturation curve determined 
using a fixed concentration of M2 and a variable concentration of M3 + TE and 
correlated well with an earlier measurement of 0.23 min"' for formation of the same 
tetraketide by DEBSl and M3 + TE (Pieper, R., et al, (1997) Biochemistry 36, 1846- 
1851). Thus, the M2 protein described here appears to be a viable alternative to 
DEBSl as a donor of the triketide intermediate to M3 + TE. From the saturation 
curve, a iCo of 1 . 1 ± 0. 1 \iM for the M2-M3 + TE interaction could be estimated 
(Figure 7A; see also Materials and Methods section). Again, this value compares well 
with the previously reported value of 2.6|jM for DEBSl and M3 + TE (Gokhale, R. 
S., etal., (1999) Chem. Biol. 6, 117-125). 

[00165] Analogous to the above study, comcubation of M2(4) with (5)M3 + TE 
allowed examination of the effects of the transplanted DEBS2-DEBS3 linker pair 
(Figure 4) on chain transfer between modules 2 and 3. The efficiency of chain 
transfer in the presence of this alternative linker pair (Figure 6D) was established by 
measuring the same parameters associated with synthesis of the tetraketide lactone 2 
[^oat of 0.74 ± 0.06 min"* and a .Kd of 2.1 ± 0.4 ^M (Figure 7B)]. 

[00166] Effects of Mismatched Linker Pairs. In contrast to the above studies 
with M2 and M3 + TE (Figure 6A) or M2(4) and (5)M3 + TE (Figure 6D), both of the 
noncomplementary coincubations [M2 with (5)M3 + TE (Figure 3B) and M2(4) with 
M3 + TE (Figure 6C)] produced the tetraketide lactone 2 at severely compromised 
rates. The highest rate constants that could be measured with either mismatched 
system were in the range of 0.02-0.03 min"^ Given these low rate constants, the 
dependence of rate on protein concentration could not be measured. As might be 
expected, the rate of production of ketolactone 4 increased markedly in both cases 
relative to the matched incubations above (data not shown), since module 3 remained 
active in these mismatched coincubations. 

[00167] Inhibition of Tetraketide Production by a Synthetic Peptide. Sequence 
analysis using the CoilScan program (Lupas, A., et at, (1991) Science 252, 1 162- 
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1 164; Lupas, A., (1996) Methods Enzymol. 266, 513-52) revealed that the N- and C- 
tenninal interpolypeptide linkers of DEBS contained 15-20 residue segments with 
strong propensity to assume a coiled-coil structure. Since the N-terminal linker of 
module 3 is relatively short (31 residues), a peptide corresponding to this sequence 
was synthesized (see Materials and Methods). As shown in Figure 8, this linker could 
inhibit the formation of 2 in the presence of M2 and M3 + TE in a concentration- 
dependent manner. To test the specificity of this peptide inhibitor, a similar assay was 
also conducted in tlie presence of M2(4) and (5)M3 + TE. No inhibitory effect was 
observed at peptide concentrations up to lOOjiM (Figure 8). Furthermore, the peptide 
also showed no inhibitory effect upon individually assayed modules, including M3 -I- 
TE. Thus, isolated peptide linkers appear to be selectively capable of competing for 
their cognate module-bound partners without affecting their individual catalytic 
activities. 

[00168] The N-terminal linker peptide of M3 was analyzed via circular dichroism 
(CD) to assess its a-helical character. As shown in Figure 9, the spectrum shows the 
208 and 222 nm absorbances characteristic of a-helices. The magnitude of these 
peaks allowed us to estimate that the peptide was approximately 50% helical (Chen, 
Y. H., et al, (1974) Biochemistry 13, 3350-3359). The CD of the peptide appeared 
invariant with peptide concentration, salt concentration, and phosphate concentration. 

[00169] Earlier studies suggested the role of structurally intact intermodular linkers 
in facilitating chain transfer between noncovalently associated modules of PKSs 
(Gokhale, R. S., et al, (1999) Science 284, 482-485). Here we have extended and 
elaborated these findings in several significant ways. First, our results have vividly 
demonstrated the selectivity associated with linker-mediated chain transfer (Figure 6). 
Although intermodular chain transfer can occur between modules possessing 
mismatched linker pairs, a serious kinetic penalty is paid. In contrast, heterologous 
matched linker pairs can facilitate chain transfer between modules as efficiently as 
natural pairs. Second, the selectivity associated with linker pair interactions suggests 
that they might play a key role in the transient assembly of functionally paired 
complexes of the three DEBS proteins (Aparicio, J. F., et al, (1994) J. Biol Chem. 
269, 8524-8528; Pieper, R.,e? al, (1995) Nature 378, 263-266). In support of this 
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hypothesis, we have demonstrated that a peptide mimic of the N-terminal linker of 
DEBS2 can inhibit chain transfer mediated by the linker pair at the DEBS1-DEBS2 
interface but not by the linker pair at the DEBS2-DEBS3 interface (Figure 8). Third, 
we show that the N-terminal linker peptide of DEBS2 has significant (ca. 50%) 
helical content (Figure 9). This observation is consistent with secondary structure 
analyses of individual linker sequences, which previously indicated a propensity of 
these sequences to assume coiled-coil conformations? It should be emphasized that, 
in order for a coiled-coil motif to facilitate noncovalent docking of two modules, the 
heterodimer (or heterooligomer) formed by associations between the C-terminal 
linker and N-temiinal linker must be thermodynamically more favorable than either 
homodioier. Although the data shown in Figure 9 show helical content of a linker, it 
does not provide evidence for formation of a coiled-coil structure. Thus, direct 
evidence for the existence of coiled coils at intermodular interfaces, as well as their 
implications for selective chain transfer, remains to be obtained. 
[00170] Earlier studies have demonstrated the importance of two additional 
molecular recognition events in controlling the overall programming and specificity 
of PKSs. First, individual modules can discriminate among alternative incoming 
substrates (Wu, N., et al, (2000) J. Am. Chem. Soc. 122, 4847-4852); this selectivity 
appears to reside within the individual ketosynthase domains (Jacobsen, J. R., et ah, 
(1997) Science 277, 367-369; Chuck, J., et al, (1997) Chem. Biol. 4, 151-166). 
Second, ketosynthase and ACP domains appear to have some degree of mutual 
recognition (Dreier, J., et al, (1999) J. Biol Chem. 274, 25108-251 12; Ranganathan, 
A., et al, (1999) Chem. Biol 6, 731-741). Both of these recognition properties are 
localized within highly conserved and catalytically critical parts of the large PKS 
modules. Here we define and dissect a third element of selectivity. In contrast to the 
previously recognized factors influencing molecular recognition by PKS components, 
linker-mediated intermodular interactions have been localized to short, nonconserved 
regions that lie outside the core modules and have no influence on the intrinsic 
chemistry of the individiial modules. 
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Example 7 

Methods Directed Towards Assessing the Effects of Protein-Protein Interactions and 
Enzvme-Substrate Interactions in the Channeling of Intermediates between Polyketide 
Synthase Modules 

[00171] Construction of Plasmids. The gene encoding ACP4(4) was amplified as 
an Ndel-EcoRl PGR firagment (523 bp) using the primers 5'- 
CCATATGGTGGTCGACCGGCTCG-3' (SEQ ID N0:6) and 5'-GAATTCCTA- 
CAGGTCCTCTCCCCC- 3 '(SEQ ID NO:7) (sequences complementary to DEBS 
shown in bold). The PGR product was cloned into pET28a (Novagen) to yield 
plasmifi pNW8. Plasmid pST157 encodes a bimodular fusion between module 1 of 
DEBSl and module 5 of DEBS3, with the thioesterase domain fused downstream of 
module 5 ("M1+M5+TE"). This fusion, which was engineered by taking advantage 
of the natural, conserved BsdBl sites located at the start of the KS domains of 
modules 2 and 5, also includes the loading didomain of DEBSl. The "linker" 
sequence that covalently bridges the fused modules is the natural sequence between 
modules 1 and 2, as in DEBSl. The fusion junction between module 5 and the 
thioesterase domain is identical to that m plasmid pRSG46.^^ Similarly, plasmid 
pST92 encodes an "M1+M6+TE" bimodular fusion. Its construction, which is 
completely analogous to that of pST157, involves introduction of this bimodular PKS 
gene from pST96, Gokhale, et al. Science 1999, 284, 482-485, as an Ndel-Ecoia into 
pET-21c (Novagen). The construction of genes encoding (5)M2+TE, (3)M3+TE, 
(5)M5+TE, and (5)M6+TE (pRSG64, pRSG64, pRSG46, and pRSG54, respectively) 
have been described previously, id, as well as the construction of a gene encoding 
(5)M3+TE (pST132). See Tsuji, et al.. Biochemistry 2001. 

[00172] Expression and Purification of Proteins. All individual modules were 
expressed and purified as previously described. Wu, et al.. Am. Chem. Soc. 2000, 
122, 4847-4852. The bimodular proteins were expressed as C-terminal Hise-tagged 
fusion proteins, and their expression and purification schemes were identical to those 
previously described for the individual modules (id.), yielding 0.2 mg/L culture of 
purified M1+M5+TE and 1 mg/mL culture of purified M1+M6+TE. ACP4(4) was 
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expressed by transforming pNW8 into E. coli BL21(DE3) cells (Novagen), which 
were then grown in LB at 37 °C to ODeoo) 0.7-0.8. BL21(DE3)/pNW8 was induced 
overnight with 1 mM IPTG at 30 °C. The cells were harvested by centrifligation, 
washed with TE buffer, and then resuspended in disruption buffer (100 mM NaH2P04 
(pH 7.2), 100 mM NaCl, 1 .2 mM DTT, 1 .2 mM EDTA, 0.7 mM benzamidine, 1 mg/L 
pepstatin, 1 mg/mL leupeptin, and 15% glycerol) before lysis by French press. 
Following removal of the cell debris by centrifiigation, the supernatant was treated 
with 0.1% (w/v) PET to remove nucleic acids followed by a 55% (NH4)2S04 precipita- 
tion. The resulting (NH4)2S04 pellet was resuspended in 100 mM NaH2P04 (pH 7.2), 
2.5 mM DTT, 1 mM EDTA, 20% glycerol (buffer A). This suspension was desahed 
on a PD-10 gel filtration colxmrn (Amersham Pharmacia Biotech AB) equilibrated 
with 10 mM imidazole in 50 mM Tris (pH 8.0), 1 M NaCl, 20% glycerol (buffer B), 
and the eluant was loaded at 1 mL/min onto a Flex-column (Kontes) packed with 5 
mL of Ni NTA-Superflow resin (Qiagen) using a peristaltic pump. After being 
washed with 35 mM imidazole m buffer B for ACP4-(4), the Hise-tagged protein was 
eluted from the resui with 90 mM imidazole in buffer B. The appropriate fractions 
were concentrated, and the buffers were exchanged to buffer A + 1 .5 M (ISIH4)2S04 in 
Centriprep 10 spin columns (Amicon). Using an Akta FLPC system (Amersham 
Pharmacia Biotech AB), the concentrated protein was loaded at 1 mL/min onto a XK 
16/20 column packed with 30 mL of Phenyl Sepharose High Performance resin and 
equilibrated with the same buffer. A gradient from 750 mM (NH4)2S04 to 0 mM 
(NH4)2S04 in buffer A was applied which eluted the protein at 0 mM (NH4)2S04. The 
appropriate fractions were concentrated in Centriprep 1 0 spin colunms to jdeld 
approximately 10-15 mg/L of purified protein which was flash frozen and stored at - 
80 °C. The mass of apo-ACP4(4) was confirmed by MALDI-MS (calculated mass: 
20492, observed mass: 20507). (MW - methionine) was also observed. 

[00173] Synthesis of CoAThioesterDiketides. The carboxylic acids of the 
diketides were synthesized as previously described. Harris, et al., J. Chem. Res. (S) 
1998, 6, 283. They include the (2S,3R), (2R.3S), (2R,3R), and (2S,3S) diastereomers 
of 2-methyl-3-hydroxy-pentanoic acid. These carboxylic acids were converted to 
CoA thioesters 5a-d under the following conditions. See Belshaw, et al., Science 
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1999, 486-489; Robertson, et al., J. Am. Chem. Soc 1991, 113, 2722-2729. 
Carboxylic acid (3.4 mg, 26 |imol), CoASH (sodium salt, 1.1 equiv, Sigma), and 
PyBOP (1.5 equiv, Novabiochem) were dissolved in 0.39 mL of THF and 0.39 mL of 
4% K2CO3 and stirred under argon for 40 min. The reaction mixture was diluted to 
up 5 mL with H20 and injected onto a Beckman Ultrasphere Cig HPLC column (250 
X 10 mm) equilibrated with 50 mM NaH2P04 (pH 4.2) in 10% MeOWRiO. Using a 
10 mL/min linear gradient over 30 min to 50 mM NaH2P04 (pH 4.2) in 80% 
MeOH/HaO, the CoA thioesters eluted at 55% MeOH. After removal of the MeOH 
on a rotavap, the product was desalted by reinjection on the same column equilibrated 
with 10% MeOH/H20 followed by elution with 90% MeOH. The product was 
lyophilized and verified by MALDI-MS (theoretical mass: 881.742; observed mass: 
882.191) and ^HNMR (500 MHz) m H2O. 5a: 0.71 (s, 3H), 0.85 (s, 3H), 0.86 (t, 
3H), 1.08 (d, 3H), 1.48 (m, 2H), 2.38 (1, 2H), 2.76 (m, IH), 2.96 (t, 2H), 3.28 (t, 2H), 
3.41 (t, 2H), 3.52 (dd, IH), 3.73 (td, IH), 3.79 (dd, IH), 3.98 (s, IH), 4.20 (t, 2H), 
4.55 (t, IH), 4.79 (m, 2H), 6.13 (d, IH), 8.21 (s, IH), 8.51 (s, IH). 5b: 0.72 (s, 3H), 
0.86 (s, 3H), 0.89 (t, 3H), 1.11 (d, 3H), 1.44 (m, 2H), 2.40 (t, 2H), 2.79 (m, IH), 2.97 
(t, 2H), 3.30 (t, 2H), 3.43 (t, 2H), 3.52 (dd, IH), 3.75 (td, IH), 3.80 (dd, IH), 3.99 (s, 
IH), 4.21 (m, 2H), 4.56 (m, IH), 4.75 (m, IH), 4.80 (m, IH), 6.15 (d, IH), 8.24 (s, 
IH), 8.54 (s, IH). 5c: 0.66 (s, 3H), 0.78 (s, 3H), 0.78 (t, 3H), 0.97 (d, 3H), 1.26 (m, 
IH), 1.48 (m, IH), 2.31 (t, 2H), 2.71 (m, IH), 2.89 (m, 2H), 3.22 (m, 2H), 3.34 (m, 
2H), 3.45 (dd, IH), 3.59 (dt, IH), 3.72 (dd, IH), 3.90 (s, IH), 4.14 (m, 2H), 4.49 (m, 
IH), 4.73 (m, IH), 4.84 (m, IH), 6.09 (d, IH), 8.25 (s, IH), 8.50 (s, IH). 5d: 0.66 (s, 
3H), 0.78 (s, 3H), 0.78 (t, 3H), 0.97 (d, 3H), 1.27 (m, IH), 1.48 (m, IH), 2.31 (t, 2H), 
2.71 (m, IH), 2.89 (m, 2H), 3.22 (m, 2H), 3.34 (t, 2H), 3.46 (dd, IH), 3.59 (m, IH), 
3.73 (dd, IH), 3.90 (s, IH), 4.14 (m, 2H), 4.49 (m, IH), 4.75 (m, IH), 4.82 (m, IH), 
6.09 (d, IH), 8.25 (s, IH), 8.50 (s, IH). Concentrations of solutions of CoA thioesters 
were determined by A260 measurement and calibration against known CoA 
concentration standards. Yield: 9.6 |jmol (37%). 

[00174] Formation of Holo-ACP and Acyl-ACP from Apo-ACP. The 
phosphopantetheinylation reactions were catalyzed by the S:^ phos-phopantetheine 
transferase, see Quadri, et al.. Biochemistry 1998, 37, 1585-1595; Weinreb, et al.. 
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Biochemistry 1998, 31, 1575-1584, under the following conditions: 150 pM apo ACP, 
4 equiv CoASH (lithiimi salt, Sigma) or acyl-CoA 5a-d, 0.2 equiv S:^ in 100 mM 
NaH2P04 (pH 6.6), 10 mM MgCh, 2.5 mM DTT, 20% glycerol at 37 °C for 20 min. 
Excess small molecules and Sfp were removed from the phosphopantetheinlyated 
ACPs by applying the reaction mixture with an Akta FPLC system to a 6 mLResource 
Q column (Amersham Pharmacia Biotech AB) and eluting with a linear gradient from 
0 mM NaCl to 500 mM NaCl m buffer A. The desired proteins eluted at 220 mM 
NaCl and were concentrated using Centriprep 10 spin columns. Protein 
concentrations were determined using a modified Lowry assay (Sigma), and the 
masses were confiimed by MALDI-MS or + ESI-MS (4a: observed mass = 20945, 
calculated mass = 20944; 4b: observed mass = 20964; 4c: observed mass = 21056; 
4d: observed mass = 20992). 

[00175] Qualitative Substrate Incorporation Assays. The reaction buffer for the 
diketide incorporation assays contained 400 mM NaH2P04 (pH 7.2), 2.5 mM DTT, 1 
mM ETDA, 20% glyercol (reaction buffer C). 1 yM module, 20 nM acyl-ACP, 500 
HMDL-[2-^'^C]-methyhnalonyl CoA (ARC), and 4 mMNADPH (Sigma) were 
incubated in 20 |xL of the reaction buffer at 30 °C for 1 .5 h. The reactions were either 
spotted directly onto a TLC plate (Whatman 250 \iM silica gel, UV254), or first 
extracted with EtOAc followed by spotting of the organic extracts on a TLC plate. 
The TLC plates were resolved using 60% EtOAc in hexanes, and the radioactive 
products were visualized on a Packard Instantlmager. 

[00176] Verification of Reaction Products. Triketide lactone products 3a and 3b 
derived from 2a (or 4a) and 2b (or 4b), respectively, have been previously verified.26 
To verify the triketide lactone products 3c and 3d, reaction extracts were purified by 
preparative TLC. The ethyl acetate extracts of the spots corresponding to the triketide 
lactones were concentrated and then derivatized to TMS ethers by incubation with 50 
|j.L o/2V,0-bis-(trimethylsilyl) trifluoroacetamide (Aldrich) for 30 min at room 
temperature. See McPherson, et al., J. Am. Chem. Soc. 1998, 120, 3267-3268. 
Injection of the sample onto a GC-MS yielded fragmentation peaks at molecular 
weights 73 and 171, corresponding to cleavage between the oxygen and silicon atoms, 
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as expected. Mass spectral confirmation data of the p-ketolactone equivalents of 3c 
and 3d were obtained sans derivatization and by ESI-MS. The elution pattern of the 
triketide lactones from a chiral HPLC column is described below. 

[00177] Determination of kcat Values. The assays for kinetic measurements were 
performed in reaction buffer C and with the same concentrations of NADPH and ''^C- 
methylmalonyl CoA as for the qualitative assays. Saturating concentrations of 
propionyl-CoA were added to and the ACP substrates were excluded from the 
bimodular reactions. To quench the reactions, 20 |j,L reaction aliquots were mixed 
with 80 |aL of 12.5% SDS. kcat values for the acyl-ACP substrates were determined 
by measuring steady-state saturating rates at multiple substrate concentrations 
(varying from 40 to 90 \jM 4). For reactions that did not saturate by 90 IM of 
substrate, the kcat values are reported as lower limits. Workup and visualization of 
the reaction products were identical to those for the qualitative assays. 

[00178] Determination of kcat/KM values. The assays for determination of 
(kBat/KM)rei Were performed with two competing substrates in the same reaction under 
the same conditions as described above for the qualitative assays, except the reaction 

volumes were doubled to 40 |xL. The data were fit into the equation where Sa and Sb 
are the two competing substrates and Pa and Pb are the corresponding products 
derived from Sa and Sb, respectively. The unknown, absolute kcat/KM values could 
then be obtained from known, absolute kcat/KM data that had been derived directly 
from the initial slopes of v versus [S] plots. See McPherson, et al., /. Am. Chem. Soc. 
1998, 12Q, 3267-3268. Each reaction was done in duplicate at two different ratios of 
subsfrate concenteations. The reactions were quenched with 120 jiL of 12.5% SDS, 
and the products were exfracted with 2 x 300 (xL of EtOAc. The organic exfracts 
were purged of highly polar compounds as well as particulates by flash 
chxomatography through 50 pL of silica gel in a 1-mL polypropylene pipet attached to 
a 3-mm, 0.22-pn nylon syringe filter (Osmonics, hic), eluting with 1.5 mL of 
EtOAc. Following removal of the organic solvents, the residual extracts were 
resuspended in 20 |j.L of hexane and loaded onto a 250 x 4.6 mm Chiralpak AS 
column with the corresponding guard column (Daicel Chemical Industries) that had 
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been equilibrated with 5% EtOH (Reagent Alcohol, Fischer) in hexane. With a flow 
rate of 0.8 xnL/xmn, the products were separated using a 20 min gradient (starting at 2 
min) from 5 to 1 5% EtOH in hexane. The reduced triketide lactone products 3a-d 
eluted at 20.0, 17.0, 21.5, and 18.5 min, respectively. The unreduced triketide lactone 
products, derived from 4c and 4d, eluted at 21.0 and 19.0 min, respectively. The 
appropriate fractions were collected, and the radio-active products were detected and 
quantified using Fonnula-989 liquid scintillation cocktail fluid (Packard) on a 
Beckman LS3801 liquid scintillation counter. 

[00179] Labeling of Holo-ACP4(4) with ^"^C-la Mediated by (5)M2+TE. Holo- 
ACF4(4) (20 (xM) was incubated vwth 1 mM [l-^^CJ-labeled 2a (custom synthesized 
by Amershani Pharmacia, specific activity 55 mCi/mmol) and 1 |j,M (5)M2+TE in 
reaction buffer C for 10 min at 30 °C. The protein was precipitated with 75% 
acetone/HaO for 5 min at -80 °C. After washing the pellet with 6.25% (w/v) TCA to 
remove excess salts followed by 500 of 75% acetone/H20 to remove residual, 
unbound 14 C-2a, the precipitated protein was resuspended in 8 of buffer A and 4 
\iL of SDS sample buffer, and resolved on a 4-20% SDS-PAGE gradient gel (Bio- 
Rad). The proteins were visualized with Coomassie blue stain and dried, and the 
radioactivity was detected either on a Packard Instantlmager or by exposing the gel to 
X-ray film. 

[00180] Construction and Expression of Bimodular Enzymes. Analogous to 
DEBSl+TE described earlier, Cortes, et al.. Science 1995, 268, 1487-1489; Kao, et 
al., J. Am. Chem. Soc. 1995, 111, 9105-9106, M1+M5+TE (module 1 + module 5 + 
TE) and M1+M6+TE are heterologous fiisions of DEBS module 1 with DEBS 
modules 5 and 6, respectively. The natural linker between modules 1 and 2 in the 
wild-type DEBSl protein was preserved in each case. In addition, the DEBS 
thioesterase (TE) domain was fused to the C termini of each downstream module to 
facilitate turnover by catalyzing the release of the triketide product. These two 
proteins were expressed as C-terminally Hise-tagged proteins and pvirified on a 
hydrophobic butyl sepharose column followed by a Resource Q ion-exchange 
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chromatography to yield approximately 0.2 mg/L ciilture of purified M1+M5+TE and 
1 mg/L culture of purified M1+M6+TE. 

[00181] Kinetic Analysis of Bimodular Constructs. Li earlier studies on the 
kinetic properties of individual modules (Wu, et al., Am. Chem. Soc. 2000, 122, 4847- 
4852), substrates were diffusively presented to the KS domain of each module as free 
A^-acetylcysteamine (NAC) thioesters. This can be contrasted with the natural mode 
of chain transfer in a multimodular system, where acyl chains arrive at the KS domain 
via direct transfer from an upstream ACP domain (Figure 4). To explore whether the 
latter mode of substrate incorporation can have kinetic benefits over the former, the 
rates of triketide lactone 3a synthesis by M1+M5+TE and MI+M6+TE were 
measured in the presence of sa!turating concentrations of propionyl-CoA, 
methylmalonyl-CoA, and NADPH. The kcat values for these two hybrid PKSs were 
determined to be 3.1 ± 0.1 and 4.1 ± 0.4 min"', respectively (Figure 12). These 
parameters compare well with the maximal rate of 4.8 min"' for DEBSl+TE (Pieper, 
et al., Biochemistry 1997, 36, 1846-1851). In contrast, we have shown earlier that 
whereas module 2+TE and module 6+TE turn 2a over with comparable rate constants 
(kcat = 4.6 and 17 min'', respectively), module 5+TE is a significantly weaker catalyst 
for the same reaction (kcat = 0.25 min"'). Addition of exogenous 2a to the reaction 
catalyzed by the bimodular proteins had no effect on their overall catalytic rates. The 
implications of these results to intramodular substrate channeling will be evaluated in 
the discussion section. 

[00182] Construction and Expression of Individual ACPs. ACP4-(4) includes 
the entire DEBS ACP4 catalytic domain with its natural C-terminal linker. (The ACP 
linker is defined as the residues between the ACP consensus sequence and the C 
terminus of the polypeptide. See Tsuji, et al.. Biochemistry 2001). This gene was 
expressed as a 20.5 kDa N-terminally Hisg-tagged protein to preserve the natural 
sequence of the C-terminal linker. ACP4(4) was purified by affinity chromatography 
on a nickel column followed by a hydrophobic phenyl sepharose column to yield 
approximately 10-15 mg/L culture of purified apoprotein. 
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[00183] Chemoenzymatic Synthesis of Acyl-ACPs. Preparations of the CoA 
thioesters of the natural diketide substrate of module 2, its enantiomer, its C-3 epimer, 
and its C-2 epimer (Figure! 1) were carried out as described in the Materials and 
Methods section. Phosphopantetheinylation of apo-ACP4(4) was cata-lyzed by Sfp 
(Quadri, et al.. Biochemistry 1998, 37, 1585-1595; Weinreb, et al.. Biochemistry 1998, 
57, 1575-1584), (Figure IOC, first reaction) to generate acyl-ACP4(4) adducts 4a-d. 
Purification of these acyl-ACPs, as described in the Materials and Methods section, 
led to >95% pure materials as judged from SDS-PAGE. Complete phospho- 
pantetheinlyation was verified by MALDl-MS or ESl-MS. 

[00184] Qualitative Assays of Diketide Incorporation by Acyl-ACPs. The acyl- 
ACP4(4) adducts 4a-d were incubated individually with (5)M2+TE, (5)M5+TE, and 
(5)M6+TE in the presence of saturating concentrations of ''^C-methylmalonyl CoA 
extender unit and NADPH. For a given acyl-ACP, the products from modules 2+TE, 
5+TE, and 6+TE were expected to be identical (Figure 11), since the modules 
catalyze the same set of reactions with identical stereocontrol (albeit normally on very 
different natural substrates). Both 4a and 4b were accepted and extended by all three 
modules. Likewise, the corresponding NAC-thioesters 2a and 2b have been shown to 
be substrates for the three modules.^^ Remarkably, however, 4c and 4d were also 
substrates for the three modules, even though no turnover of the corresponding NAC 
thioesters 2c and 2d was detected in the case of any module, Wu, et al.. Am. Chem. 
Soc. 2000, 122, 4847-4852 (It should be noted that elongation of 4c and 4d by 
modules 5+TE and 6+TE yielded minor quantities of unreduced triketide lactones, 
indicating less efficient p-ketoreductase activity on these two anti-diketide substrates 
than on the two syn-diketide substrates.). Consistent with previous linker studies,^^'^^ 
while all four acyl-ACP adducts were observed to be substrates for (5)M3+TE, no 
product formation was observed from the incubation of any of the acyl-ACP adducts 
with (3)M3+TE, even though 2a and 2b have previously been shown to be readily 
incorporated and elongated when presented to either module 3+TE derivative, Wu, et 
al., Am. Chem. Soc. 2000, 122, 4847-4852; Tsuji, et al.. Biochemistry 2001. Thus, 
matched linker pairs appear to be capable of enhancmg the efficiency with which 
otherwise poor substrates can be channeled between modules. Conversely, 
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mismatched linkers can present a major barrier to the channeling of otherwise 
acceptable substrates between modules. Control studies performed with 2a and 5a 
showed that the two compounds are approximately equivalent substrates for the same 
modules (data not shown). From the amount of product detected in these ACP- 
mediated reactions, the efficiency of the PKS-catalyzed reaction could be estunated. 
Under typical assay conditions (20mM 4 and IpM (5)M5+TE), 70% of the acyl-ACP 
was converted into triketide lactone ui 1 h. Two conclusions can be drawn ftom this 
result. First, acyl-ACPs are significantly superior substrates to acyl-NAC thioesters. 
(Typically, millimolar concentrations of the NAC thioester must be used to detect 
comparable amounts of product under otherwise similar assay conditions.) Second, 
the assay system described in Figure 1 1 allows for monitoring multiple turnovers of 
the enzyme. Indeed, as described below, in all cases the maximum rates of 
consumption of the acyl-ACP substrates were comparable to or higher than the 
maximum rates of consumption of tiiekNAC thioester counterparts (see below). 
Therefore, the association of the donor ACP and the acceptor module must be 
transient, and the dissociation rate constant of the ACP from the module must be 
significantly faster than the slowest step in the module-catalyzed elongation sequence. 

[00185] Kinetic Analysis of Incorporation of Diketides from Acyl-ACPs. The 

kcat/Kw values for the reactions of 4a and 4b with (5)M2+TE, (5)M5+TE, and 
(5)M6+TE are shown in Figure 13. Since full saturation curves could not be obtained 
for the ACP-bound substrates for technical reasons, the kcat/Kw value for 4b was 
derived by competitive assay against 2a (whose absolute KJKu value was derived 
from the initial slope of the v versus [S] saturation curve). Likewise, the k^JKu value 

; for 4a was derived by competitive assay against 4b. Several observations are 

noteworthy regarding the data summarized in Figure 13. First, 2a and 4a are 
significantly better substrates than 2b and 4b for each module. In addition, the 
improvement m specificity for an acyl-ACP adduct over its NAC thioester counterpart 
is particularly pronounced in cases where the NAC thioesters are exceptionally poor 

) (e.g., module 2+TE-catalyzed elongation of 4b versus 2b, module 5+TE-catalyzed 

elongation of 4a versus 2a, and especially module 5+TE-catalyzed elongation of 4b 
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versus 2b). The implications of these observations are elaborated in the Discussion 
section. 

[00186] To quantify the kinetic advantage of channeling in the above assay system, 
the kcat values for the reactions of 4a-d with modules 2+TE, 5+TE, and 6+TE were 
measured (Figure 14). Kinetic measurements were performed at substrate 
concentrations between 40 and 90|xM of each acyl-ACP substrate. For reactions that 
yielded measurable quantities of unreduced triketide lactone products (i.e., elongation 
of 4c and 4d by modules 5+TE and 6+TE), both reduced and unreduced products 
were combined and included for calculations of the kinetics parameters. Except for 
the reaction of 4a with module 2+TE, none of the substrates saturate the enzymes in 
this concentration range, and thus, the kcat values are reported as lower bounds. 
Comparison of the kcat values of the acyl-ACP forms of the four diastereomeric 
diketides with those ftom the corresponding reactions involving NAC thioesters 
substrates (which are also shown in Figure 14) (Wu, et al., Am. Chem. Soc. 2000, 122, 
4847-4852) affords two interesting observations. First, in those cases where the NAC 
thioester is a reasonably good substrate (e.g., module 2+TE- or module 6+TE- 
catalyzed elongation of 2a and 4a), the maximal reaction rates are comparable 
regardless of whether the acyl chain is bound to NAC or an ACP with a matched 
linker. In contrast, where the NAC thioester is an inferior substrate (e.g., module 
2+TE-catalyzed elongation of 2b and 4b,M5+TE-catalyzed elongation of 2a and 4a, 
module 5+TE-catalyzed elongation of 2b and 4b, and elongation of 4c and 4d by any 
of the modules), tethering the same acyl chain to an ACP with a matched linker can 
result in significant improvements in kcat- Second, the maximal rate of tumover of 2a 
and 4a is significantly greater than that of 2b and 4b for all tested modules. Again, 
the implications of these fmdings are discussed below. 

[00187] Investigation of the Reversibility of the Donor ACP to Acceptor KS 
Transfer Reaction. Ordinarily, the flow of intermediates in a metabolically active 
PKS is vectorial. A possible mechanism for such directionality could be that, once an 

acceptor KS is acylated with the incoming chain, conformational changes in the 
module prevent the pantetheine arm of the donor ACP fi-om accessing the active site 
again. To test whether this may be the case, holo-ACP4(4) was incubated with ^'*C-2a 
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in the presence and absence of (5)M2+TE (Figure 15). The ACP was radiolabeled 
only in the reaction containing (5)-M2+TE. (As expected, (5)M2+TE was also 
labeled.) When methylmalonyl CoA extender units and NADPH were added to induce 
catalytic activity of the module, (5)M2+TE dependent labeling of ACP4(4) was also 
observed, but the degree of labeling was considerably reduced. Thus, there does not 
appear to be an absolute barrier to the back-transfer of an acyl chain from a KS to the 
ACP of the preceding module. 

[00188] We have previously investigated the substrate specificity of individual 
modules of DEBS using diketide substrates activated as N-acetylcysteamine (NAC) 
thioesters (Figure lOA) (id.). The diketides included in the previous study were the 
four diastereomeric forms of the natural substrate for module 2 (Figure 1 1). These 
substrates were assayed against DEBS modules 2, 3, 5, and 6~each with a TE domain 
ftised to the C terminus to facilitate tumover-and steady-state kinetic parameters 
were determined for each substrate-enzyme combination. The substrate specificity 
profiles (as reflected in the kcat/KM values) for the four enzymes were found to be 
remarkably similar in that all of the modules preferred 2a over 2b, and, within 
detection limits, neither of the two anti-diketides 2c or 2d was observed to be a 
substrate for any of the modules. 

[00189] The preference of 2a over its enantiomer 2b for all modules was especially 
intriguing in light of the fact that the natural substrates for modules 3 and 6 share 
more structural similarities to 2b than to 2a. One explanation for this discrepancy 
was that the NAC thioester-based assay system (Figure lOA) may not entirely 
represent the mechanism of acylation of a multimodular system. While NAC- 
thioesters substrates must be loaded diffusively onto the KS of a module (Figure 
lOA), polyketide intermediates are chaimeled from the ACP of one module to the KS 
of the downstream module via covalent transfer. Substrate channeling in a 
multimodular system can occur either between two modules on the same polj^eptide 
(e.g., between modules 1 and 2; Figure lOB), or between two modules on separate 
polypeptides (e.g., between modules 2 and 3; Figure IOC). Further evidence that 
protein-protein interactions may influence the substrate specificities of individual 
modules emerged from previous experiments suggesting, that inter-polypeptide 
linkers—defined as the highly variable regions outside the consensus sequences of the 
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modules-are involved in mediating selective intermodular chain transfer, Gokhale, et 
al.. Science 1999, 254, 482-485; Tsuji, et al.. Biochemistry 2001. To investigate the 
balance of protein-protein interactions and enzyme-substrate interactions in 
controlling polyketide chain elongation, two assay systems that take intermodular 
interactions into account were used in this study. 

[00190] Kinetic Channeling in Intrapolypeptide Chain Transfer. In the first 

system (Figure lOB), the effect of substrate channeling between two modules within 
the same polypeptide was investigated. More specifically, in the context of the 
bimodular constructs M1+M2+TE (DEBSl+TE), M1+M5+TE, and M1+M6+TE, 
DEBS modules 2, 5, and 6 were examined for their abilities to accept and elongate the 
natural diketide intermediate that was passed ftom a covalently attached module 1 . 
The turnover number of M1+M5+TE was comparable to that of M1+M6+TE or the 
"wild-type" M1+M2+TE. In contrast, when primed diffusively by 2a, the maximum 
catalytic rate of module 5+TE is significantly reduced compared to that of module 
2+TE or module 6+TE. This disparity indicates that covalent connection of modules 
can have a beneficial kinetic effect and hinted that, in addition to physical channeling 
of intermediates, multunodular PKSs are also capable of kinetic chaimeling of 
intermediates. However, since the only incoming substrate that could be tested using 
this assay system was the natural diketide, a better assay system was needed to 
explore the role of kinetic channeling more generally. 

[00191] Kinetic Channeling in Interpolypeptide Chain Transfer. The minimal 
donor protein requirement for substrate channeling to an acceptor module was 
postulated to be an ACP domain with an appropriate C-terminal linker. Therefore, we 
constructed, expressed, and purified the ACP4 domain and its natural C-terminal 
linker as an individual polypeptide. A variety of acyl groups were then covalently 
attached to the phosphopantetheine arm of holo-ACP4(4) via a chemoenzymatic 
procedure (Figure IOC, first reaction). The resulting acyl-ACP4(4) adducts 4a-d were 
tested for their ability to transfer the attached diketides fi-om ACP4(4) to the KS of an 
acceptor module (Figure IOC, second reaction), where they could then undergo 
standard chain elongation to yield a triketide lactone (Figure IOC, third reaction). The 
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small size of the ACP4(4) protein, together with high expression levels of soluble 
protein in Escherichia coli, allowed production of reagent quantities of this protein for 
use as a substrate in multiple turnover assays. The requirement of both wa ACP and 
the linker was highlighted by the fact that corresponding CoA thioesters exhibited 
comparable kinetic parameters and that mismatched linkers led to a dramatic 
reduction in turnover efficiency in the case of module 3+TE. The latter feature is 
consistent with the linker hypothesis developed earlier. Id. Although the precise Km 
values for individual acyl-ACP substrates could not be measured in many cases, in all 
cases they were estimated to be approximately 2-3 orders of magnitude lower than the 
reported Km values for the corresponding NAC thioester reactions (micromolar for 
acyl-ACPs versus millimolar for acyl-S-NACs), thus making acyl-ACPs excellent 
substrates for individual PKS modules. 

[00192] Implications of the Interpolypeptide Transfer Kinetics Data. The 

establishment of the acyl-ACP-based assay system allowed us to address two 
important questions regarding the relative balance of protein-protein interactions and 
enzyme-substrate interactions in multimodular systems. First, is the universal 
preference among the three tested modules for 2a over 2b preserved when the same 

substrates are delivered as acyl-ACP adducts? And second, under saturation 
conditions, can kinetic channeling of these diketide substrates be observed for any 
module? 

[00193] As seen in Figure 13, the preference for the (2iS',3i?)-diastereomer over its 
enantiomeric (2i?,35)-diastereomer by all modules is preserved regardless of whether 
the substrate is loaded by a channeling mechanism or by a diffusive mechanism. This 
conserved preference suggests that the catalytic steps in a given module that 
discriminate between different substrates remain unchanged whether modules are 
primed diffusively or by a channeling mechanism and that for module 2, at least, the 
most likely source of discrimination is the KS acylation step. Furthermore, the 
turnover numbers of the individual enzjmie-catalyzed reactions reported in Figure 14 
make a strong case for kinetic chaimeling in multimodular PKSs. When saturating 
concentrations of 2b are co-incubated with M5+TE in the presence of methylmalonyl- 
CoA and NADPH, the elongation rate constant is only 0.02 min"^ hi contrast, when 
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the same reaction is monitored using 4b as a substrate, the elongation rate constant 
(kcat) increases at least 25-fold. Similarly, a maximal rate increase of greater than 40- 
fold is observed in the elongation of 4a versus 2a by M5+TE. The effect may be even 
more pronounced for the two anti-diketides, whose elongation rates are below 
detectable limits (<0.01 min') when presented as NAC thioesters 2c and 2d, but are 
quite respectable ( ~ 1 min') when presented as acyl-ACP adducts 4c and 4d. Of 
course, if the KM values for the two anti-diketides when presented as NAC-thioesters 
are significantly higher than the sohibility-limited concentrations that were used in the 
assay, then the apparent kinetic advantage of channeling the anti-diketides could be 
artificially high. Even so, these overall results indicate that channeling dramatically 
increases the efficacy of poor substrates. In addition, the results reported here provide 
insight into how multimodular PKSs can be so remarkably tolerant toward protein 
engineering, even though individual modules are fairly specific catalysts. 

[00194] The ReversibiBty of AC/Vi to KSn+1 Transfers. Fmally, the ACP- 
mediated strategy for diketide loading onto acceptor modules also enabled us to 
address the question of reversibility of the transacylation reaction between the donor 
AC? and the recipient KS. While co-incubation of "*C-labeled 2a with holo-ACP4(4) 
afforded essentially no labeling of the ACP, co-incubation of '"^C-labeled 2a with 
holo-ACP4(4) in the presence of (5)M2+TE gave both labeled (5)M2+TE and 
ACP4(4) (Figure 15 A). The proposed mechanism for the observed labeling is shown 
in Figure 15B and requires back-transfer of the acyl group from the KS to the ACP of 
the upstream module. Back-transfer was also observed under turnover conditions, 
albeit at a substantially reduced level. Thus, the observed directionality of chain 
transfer in the context of a multimodular PKS that is rapidly turning over appears to 
arise due to kinetic channeling of these intermediates rather than a ratchet mechanism 
that explicitly precludes back-transfer. However, given the 20-fold excess of ACP to 
module, the occupancy level of the diketide on the ACP is quite low. Consequently, 
in a PKS where two modules on separate polypeptides exist in approximately 
equimolar ratios, reverse transfer from a downstream module to an upstream modide 
occurs rarely and only at steps where a significant barrier for forward chain transfer is 
encountered. In contrast, for intermodular chain transfer between modules within the 
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same polypeptide, the effective molarity of the donor ACP group is significantly 
higher, and reversible transfer may be more significant (but without chemical 
consequence). The likelihood of intrapolypeptide reverse transfer may explain why 
the loading didomain shows relatively low selectivity for a propionyl starter unit 
versus an acetyl starter unit (2:1), see Lau, et al.. Biochemistry 2000, 39, 10514- 
10520, whereas DEBSl+TE discriminates strongly (32:1) between the two starter 
units. See Pieper, et al., Biochemistry 1996, 55, 2054-2060. 

[00195] These studies represent the first direct observation of kinetic channeling of 
intermediates in a modular PKS. Several dramatic examples are presented for both 
intrapolypeptide transfers and interpolypeptide transfers where the maximal rate 
constant (kcat) for elongating a particular ketide substrate by a DEBS module increases 
10- to > 100-fold when the substrate is channeled relative to when it is diffiisively 
presented. Linkers are shown to play an important role in kinetic channeling, 
although the contribution of other elements, such as the pantetheine arm or protein- 
protein interactions between the donor and recipient modules, cannot be excluded. In 
addition, our studies have also reinforced the fact that, while individual modules are 
tolerant of stereochemical diversity in diketides, they are at the same time fairly 
specific catalysts. In addition, their specificities and recognition features do not 
necessarily correlate with the structures of their natural substrates. Finally, we have 
shown that the transfer step from a donor ACP to an acceptor KS is a fundamentally 
reversible reaction. Structural and more detailed mechanistic studies on these 
remarkable multifimctional catalysts should be particularly interesting from the 
viewpoint of understanding the atomic basis for the phenomena described here. 

Methods Directed Towards Assessing the Effects of the Interactions between the ACP 

Domain and the KS Domain and Linker Interactions 
[00196] Construction of Plasmids. The construction of genes encoding 
(5)M2+TE, (3)M3+TE, (5)M5+TE, and (5)M6+TE (pRSG64, pRSG34, pRSG46, and 
pRSG54, respectively) (Gokhale, et al., (1999) Science 284, 482-5); (5)M3+TE 
(pST132) (Tsuji, et al., (2001) Biochemistry 40, 2317-2325); ACP4(4) (pNW8) (Wu, 
et al., (2001) J. Am. Chem. Soc. 123, 6465-6474); eryLDD (pJL636) (Lau, et al., 
(2000) Biochemistry 39, 10514-10520); andNovH(0) (Chen, et al., (2001) Chem Biol 
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74, 1-12) have been previously described. (3)M5+TE encodes a derivative of DEBS 
module 5 in which its natural N-terminal linker has been replaced with the N-terminal 
linker fiom module 3. The N-terminal linker of module 3 was excised from pRSG34 
(Gokhale, at al., (1999) Science 284, 482-5) (which encodes (3)M3+TE) as an Ndel- 
BsaBl fragment. The resulting fragment was used to replace the corresponding Ndel- 
BsaBI fragment in pRSG45, which encodes (5)M5+TE, (id) to yield pST133. 
ACP2(2) encodes the ACP domain of DEBS module 2 through its natural stop codon. 
This sequence was extracted from the gene cluster as an Ndel-EcoRl fragment by 
PGR using the following primers: 

5'-C CAT ATG CTG CGC GAG CGG CTG-3' (SEQ ID N0:8), 
5'-GAA TTC TCA ATG GCC GTC GAG CTC C-3' (SEQ ID 

N0:9). 

[00197] ACP2(4) encodes the ACP domain of DEBS module 2 with its natural C- 
terminal linker replaced with the corresponding linker from module 4 using an 
engineered Spel site at the junction. The ACP domain was obtained as an NdelSpel 
fragment by PGR using the following primers: ' 

5'-C CAT ATG GTG GTC GAG CGG CTC G-3' (SEQ ID NO:10), 
5'-ACT AGT GAG GAA ACC GGC GAG CG-3' (SEQ ID NO:l 1) 
(sequences complementary to DEBS shown in bold). Generation of the C-terminal 
linker region as an Spel-EcoBl fragment by PGR has been previously described 
(Tsuji, et al., (2001) Biochemistry 40, 2317-2325). These two fragments were cloned 
into pET28a to give pNW19. ACP2(0) and ACP4(0) encode the ACP domain of 
DEBS module 2 and module 4, respectively, with stop codons engineered at the end 
of the regions of homology. The coding regions were obtained as Ndel-EcoRL 
fragments by PGR using the following primers: 

5'-C CAT ATG CTG CGC GAC CGG CTG-3' (SEQ ID N0:12), 
5'-GAA TTC TTA GCC GAG CTC GGC GTC-3' (SEQ ID N0:13) 
for ACP2(0) and primers: 

5'-C CAT ATG GTG GTC GAC CGG CTC G-3' (SEQ ID 

N0:14), 

5'-GAA TTC TTA GAA CAG CCT GTC CGG CAG-3' 

(SEQ ID NO: 15) 
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for ACP4(0). The PGR products were cloned into pET28a to afford pNW6 
(ACP2(2)), pNW7 (ACP2(0)) and pNW9 (ACP4(0)). NovH(4) encodes the 
adenylation (A) and peptidyl carrier protein (PCP) domains of the NovH open reading 
frame (ORF) from the novobiocin pathway (Chen, et al., (2001) Chem Biol 74, 1-12). 
It was fused to the C-terminal linker of module 4 of DEBS as follows. DNA 
encoding NovH was derived from pHClO {id.) as an Ndel-Xhol fragment. The linker 
region was obtained as snXhol-Bpul 1021 fragment using the following primers: 
5'-CTG CTC GAG AGG CTG TTC GCG GCC TCA-3' 

(SEQ ID 

N0:16) 

5'-C CCG CTG AGC CTA CAG GTC CTC TCC CC-3' 

(SEQ ID 
NO: 17). 

These two fragments were cloned into pET28a to yield pNW35. 
[00198] Expression and purification of individual modules. All previously 
characterized single modules were expressed and purified as preAdously described 
(Wu, et al., (2000) J. Am. Chem. Soc. 122, 4847-4852; Tsuji, et al., (2001) 
Biochemistry 40, 2317-2325). (3)M5+TE (pST133) was expressed using a slightly 
modified version of the protocol used for previously characterized individual modules 
{id.). This protein was expressed in E. coli BAPl (Pfeifer, et al., (2001) Science 291, 
1790-1792) in which the sfp phosphopantetheinyl transferase gene from Bacillus 
subtilis (Lambalot, et al., (1996) Chem Biol 3, 923-36) has been inserted into the 
chromosome. BAPl/pST133 cells were grown at 37°C in LB media witii 100 mg/L 
of carbenicillin to an ODeoo = 0.5, at which pomt they were cooled to 22°C in a water 
batii and then induced with 0.7 mM IPTG for 12 hours. The cells were harvested by 
centrifugation, washed with 50 mM Tris/1 mM EDTA (pH 8), and then resuspended 
in disruption buffer (100 mM NaH2P04 (pH 7.2), 100 mM NaCl, 1.2 mM DTT, 1 .2 
mM EDTA, 0.7 mM benzamidine, 1 mg/L pepstatm, 1 mg/mL leupeptin, and 15% 
glycerol) before lysis by French Press (2x). After the cell debris was removed by 
centrifugation, the supernatant was treated with a 0.1% PEI precipitation followed by 
a 60% (NH4)2S04 precipitation for 2 hours. The resulting (NH4)2S04 pellet was 
resuspended in buffer A (see Reagents and Chemicals section above for composition), 
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flash frozen in liquid nitrogen, and stored at -80°C until ready for further purification. 
The crude protein was purified by FPLC on a hydrophobic butyl sepharose column 
followed by a Resource Q anion exchange column as previously described (Wu, et al., 
(2000) J. Am. Chem. Soc. 122, 4847-4852; Tsuji, et al., (2001) Biochemistry 40, 2317- 
2325) to yield 10 mg/L culture of purified (3)1V[5+TE. 

[00199] Expression and purification of ACP and PCP proteins. Apo-ACP4(4) 
and apo-NovH(0) were expressed in the E. coli strain BL21(DE3) and purified as 
previously described (Wu, et al., (2000) J. Am. Chem. Soc. 122, 4847-4852; Chen, et 
al., (2001) Chem Biol 74, 1-12). Apo-ACP2(2), apo-ACP2(0), apo-ACP4(0), apo- 
ACP2(4), and apo-NovH(4) were obtained by overexpression of pNW6, pNW7, 
pNW9 and pNW19, respectively, in the E. coli strain BL21(DE3). After growth in 
LB (50 mg/L kanamycin) at 37°C to ODgoo = 0.5-0.7, the cells were cooled in a water 
bath to 22°C and then induced with 1 mM IPTG for 12 hours at 22°C. The cells were 
then harvested by centrifugation, washed with 50 mM Tris (pH 8), and then 
resuspended in buffer B before lysis by French Press (2x). The cell debris was 
cleared by centrifugation and the supernatant batch loaded onto Ni NTA-agarose 
(Qiagen) resin (4 mL/L culture) for 1 hour. The resin was loaded into a Flex-column 
(Kontes), washed with 10 column volumes of 35 mM imidazole in buffer B (see 
Reagents and Chemicals section above for composition), and then the desired N- 
terminal Hise-tagged proteins were eluted with 100 mM imizadole in buffer B. The 
appropriate fractions were concentrated and the buffers were exchanged to buffer A 
(see Reagents and Chemicals section above for composition) + 1.5 M (NH4)2S04 in 
Centriprep spin columns (Amicon). Using an Akta FLPC system (Amersham 
Pharmacia Biotech AB), the concentrated protein was loaded at 1 mL/min onto a XK 
16/20 column packed with 30 mL Phenyl Sepharose High Performance resin and 
equilibrated with the same buffer. A gradient firom 1 M (NH4)2S04 to 0 M CNH4)2S04 
in buffer A was applied, resulting in the elution of the desired proteins between 1 50 
mM and 0 mM (NH4)2S04. The appropriate fractions were concentrated and buffer 
exchanged to buffer A in Centriprep spin columns to yield approximately 6 mg/L of 
ACP2(2), 15 mg/L culture of ACP2(4), 5 mg/L culture of purified ACP2(0), and 3 
mg/L culture of ACP4(0). These purified proteins were then flash frozen in liquid 
nitrogen and stored at -80°C. Expression and purification of apo-NovH(4) were 



69 



wo 03/076581 



PCT/US03/06910 



perfonned under the same condition as described for the ACP proteins, except 
expression was induced with 0.1 mM IPTG at 15°C. These conditions yielded 25 
mg/L culture of purified NovH(4) The masses of these proteins were confirmed by 
ESI-MS or MALDI-MS. The parent masses of the proteins were found in all cases. 
Mass peaks 178 daltons less than the parent masses were found in some cases, 
corresponding to loss of N-terminal N-fonnylmethionines. The apo-ACP2(0): 
observed mass = 12073 (parent mass) and 1 1895 (mass - formylmethionine), 
calculated mass = 12027. apo-ACP4(0): observed mass =11917 (parent mass), 
calculated mass = 1 1901. apo-ACP2(2): observed mass = 20532 (parent mass) and 
20354 (mass - formylmethionine), calculated mass = 20495. apo-ACP2(4): observed 
mass = 20635 (parent mass) and 20457 (mass - formylmethionine), calculated mass = 
20661. apo-NovH(4): observed mass = 74502 (parent mass) and 74323 (mass - 
formylmethionine), calculated mass = 74626. 

[00200] Chemoenzymatic synthesis of diketide-ACP and diketide-PCP 
substrates. The apo-PCP and apo-ACP proteins were converted to their respective 
diketide-ACP forms as previously described and as shown in figure 24 A (Wu, et al., 
(2001) J. Am. Chem. Soc. 123, 6465-6474). Briefly, phosphopantetheinylation of 
each active site serine residue was catalyzed by sfp in the presence of 2, which was 
synthesized as previously described {id.). The diketide-ACP/PCP substrates were 
either immediately used in Hie module substrate incorporation assays or used after 
purification by ion exchange chromatography. Purified protein concentrations were 
determined by Lowry assay. The masses as well as complete 
phosphopantetheinylation were confirmed by ESI-MS or MALDI-MS. A SDS-PAGE 
gel of the purified proteins is shown in figure 25. Representative mass spectral datum 
are shown in figure 26 C, illustrating the purity and the complete conversion firom the 
apo-ACPs. The parent masses of the proteins were foimd in all cases. Mass peaks 
178 daltons less than the parent masses were found in some cases, corresponding to 
loss of N-terminal N-formylmethionines. All observed parent masses are within the 
1% error range that is expected from the spectrometers. Diketide-ACP2(0): observed 
mass = 12528 (parent mass) and 12350 (mass - formylmethionine), calculated mass = 
12498. Diketide-ACP4(0): observed mass = 12372 (parent mass), calculated mass = 
12353. Diketide-ACP2(2): observed mass = 20809 (parent mass) and 20988 (parent 
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mass - fomylmethionine), calculated mass = 20947. Dilcetide-ACP2(4): observed 
mass = 21089 (parent mass) and 20910 (parent mass - formylmethionine), calculated 
mass = 21113. Diketide-NovH(4): observed mass = 74783 (parent mass), calculated 
mass = 75078. 

[00201] Substrate transfer and elongation assays. Qualitative assays were 
performed with a diketide-ACP or diketide-PCP substrate either taken directly from 
the sfy) phosphopantetheinlyation reaction or after fiirther purification of the substrate. 

These assays were performed with 20 fiM diketide-ACP/PCP substrate for 2 hours in 
the following reaction conditions: 1 (xM acceptor module, 0.5 mM '*C-methylmalonyl 
CoA, 4 mM NADPH in buffer C, 30°C. After quenching by addition of 250 ^iL 
EtOAc and vortexmg, the products were extracted with 2 x 250 |xL EtOAc, resolved 
on a silica gel TLC plate, and visualized on a Packard Instantlmager. A 
representative TLC plate image is shown in figure 30 B. 

Example 8 
Protein Preparations 

[002021 ACP4(4) (i.e., DEBS ACP4 with its natural C-terminal linker) {id. ) and 
eryLDD(0) (i.e., the DEBS loading didomain with no C-terminal lii±er) (Lau, et al, 
(2000) Biochemistry 39, 10514-10520) were constructed and expressed as previously 
described. ACP2(2) includes the DEBS ACP2 domain and its natural C-terminal 
linker. The linker is defined as the sequence from the end of the ACP consensus 
sequence to tihe natural stop codon (Tsuji, et al., (2001) Biochemistry 40, 23 17-2325.). 
ACP2(4) was constructed as a fusion protein between ACP2 and the C-terminal luiker 
of ACP4. ACP2(0) and ACP4(0) are isolated ACP domains without linker regions. 
All proteins were expressed as N-terminally Hise-tagged apo proteins that could 
subsequently be purified by Ni-affinity chromatography to yield 6 mg/L culture of 
ACP2(2), 15 mg/L culture of ACP2(4), 5 mg/L culture of purified ACP2(0), 3 mg/L 
culture of ACP4(0), and 25 mg/L culture of NovH(4). These proteins were converted 
to diketide-ACPs and diketide-PCP substrates by phosphopantetheinylation with sfp 
in the presence of 2, as previously described (Wu, et al., (2001) J. Am. Chem. Soc. 
123, 6465-6474). An SDS-PAGE gel of the purified protem substrates is shown in 
figure 25. In addition, representative mass spectral datum of diketide-ACP2(2) and 
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diketide-ACP2(4) are shoAvn in figure 26 C to demonstrate quantitative 
phosphopantetheinlyation by sjjp. 

[00203] (5)M2+TE, (3)M3+TE, (5)M3+TE, (5)M5+TE, and (5)M6+TE were 
constructed and expressed as previously described (Wu, et al., (2000) J. Am. Chem. 
Soc. 122, 4847-4852; Tsuji, et al., (2001) Biochemistry 40, 2317-2325). pST133 
encodes (3)M5+TE which is a fusion protein of module 5 covalently attached to the 
thioesterase domain to facilitate turnover. In addition, the natural N-terminal linker of 
module 5 is replaced with the N-terminal linker of module 3. Expression and 
purification of this protein was carried out according to the previously reported 
protocol (Wu, et al., (2000) J. Am. Chem. Soc. 122, 4847-4852). 

Example 9 

Analysis of the Modularity of Linker Regions 
[00204] The linker regions have previously been suggested to be modular, or 
fimctionally independent (Gokhale, et al., (1999) Science 284, 482-5; Tsuji, et al., 
(2001) Biochemistry 40, 23 1 7-2325). The kinetics of substrate transfer at the module 
2-module 3 interface followed by elongation and product release were examined as a 
function of the keoiM and kca/KM values of the overall reaction. The k6o^M values 
reported here represent the apparent overall rate of product formation at an initial 
substrate concentration of 60 ]iM. In many cases, the Argo^A/ values approximate the 
maximal overall turnover rates, as determined by back-calculating the iC^f value for 
the reactions. True saturation kinetics were not practical because of the technical 
limitations (e.g., solubility) and limited supply associated with high molecular weight 
substrates such as diketide-ACP and diketide-PCP. ha/KM values were determined 
by competitive assay of the substrate of interest against a substrate with a known 
kca/Kuyalue, as previously described (Wu, et al., (2001) J. Am. Chem. Soc. 123, 
6465-6474). This method for determining ^ca/ST^ values was chosen because it 
allowed us to conserve our limited supply of protein-based substrates compared with a 
durect measurement of the initial slope of a full v vs. [S] plot. A representative time 
course and liquid schitillation coimting data used to determine ^^ohM values and 
^ca/^M values are shown in figures 28 E-F, respectively. 

[00205] These reactions were quenched by the addition of 80 jjL 12.5% SDS to 2 
|j,L reaction mixture and immediate vortexing. The products were then extracted from 
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the aqueous phase with 2x 250 \iL EtOAc. After removing the organic solvents in 
vacuo, the residual products were then spotted onto a TLC plate (Baker-flex 250 uM 
silica gel), resolved in 60% EtO Ac/40% hexanes, and the radioactive spots were 
visualized and quantified on a Packard Instantlmager. In the first reaction, shown in 
figure 26 A, diketide-ACP2 and module 3 with their natural linker regions manifest 
kgOixM and kaa/KM values of 1 .4 min"' and 390 mm'^mM"*, respectively. When the 
module 4-module 5 linker pairs are transplanted into the module 2-module 3 interface 
as shown in figure 26 B, the kgOixM value remains approximately the same, but the 
kca/KM value decreases to 56 min''mM"\ This comparison suggests that swapping out 
natural linker pairs for alternative linker pairs affects the Km value of the transfer and 
elongation reaction, but not the maximum rate. 

Example 10 

Analysis of the Relative Contributions of the Donor AC?. Acceptor KS and Linkers 
to Chain Elongation 

[00206] Various donor ACP-acceptor module pairs were examined for their ability 
to transfer substrates from the donor ACPs to the acceptor modules, which could then 
elongate and release triketide lactone product. Two sets of reactions were carried out 
- one in which the acceptor module was DEBS module 3 and the other in which the 
acceptor module was DEBS module 5. For each set of reactions, reactions were 
performed representing one of the following conditions: A) matched linkers and 
matched donor ACP-acceptor KS pairs, B) mismatched linkers and matched ACP-KS 
paks, C) matched linkers and mismatched ACP-KS pairs, or D) mismatched linkers 
and mismatched ACP-KS pairs. As indicated by the formation of the expected 
triketide lactone product, transfer of diketide fiwm the donor ACP to the acceptor 
module occurred at 20 nM substrate concentration in the reactions shown in figures 
27 A-C and 28 A-C. These successful reactions represent conditions A-C (as defined 
above), and their kinetic parameters were further investigated. In contrast, no product 
was detected at the same substrate concentrations from the reactions in figures 27 D 
and 28 D (representing condition D), mdicating that transfer did not occur in the 
presence of both mismatched Imkers and ACP-KS pairs. This qualitative data 
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indicates the diketide substrate can be transferred to module 3 or 5 as long as either 
the linkers are matched or the ACP-KS pairs are matched. 

[00207] In order to quantify the relative contributions of the linker pairs versus the 
ACP-KS pairs to the efficient channeling of substrates, keoiiM and kca/Ku values were 
measured for the reactions shown in figures 27 A-C and 28 A-C. The reactions of 
diketide-ACP2(2) + (3)M3+TE (figure 27 A) and diketide-ACP4(4) + (5)M5+nE 
(figure 28 A) manifest ^(5o^Af values of 1.4 min'' and 9.3 min'' and Arca/Kji/ values of 
390 min"'mM"' and 290 min"'mM"', respectively. In contrast to these reactions 
comprising matched linkers and matched ACP-KS pairs, the reactions in which either 
the linkers are mismatched or the ACP-KS pairs are mismatched (but not both) 
manifest significant and similar decreases in catalytic efficiencies and specificities. 
While the keo^M A;ca/-KAf values for the mismatched reactions shown in figures 27 
B-C fell approximately 3-5 fold and 80-200 fold, respectively, the corresponding 
values for the mismatched reactions shown in figures 28 B-C fell approximately 20- 
fold and 150-fold, respectively. These data suggest that for both module 3 and 
module 5, the linker interactions and the donor ACP-acceptor KS interactions play 
significant and approximately equal roles in the chaimeling of substrates between 
modules. 

Example 11 

Analysis of Chain Elongation by Various Acceptor Modules in the Presence of a 
Linkerless ACP4 

[00208] Linker interactions were eliminated entirely firom the transfer and 
elongation assays in the reaction of linkerless diketide-ACP4(0) with (5)M2+TE, 
(5)M5+TE, (3)M5+TE, and (5)M6+TE. Formation of the expected triketide lactone 
was observed firom the reactions of diketide-ACP4(0) with (5)M5+TE and 
(3)M5+TE (figure 29 A-B), both of which contained matched ACP-KS pairs. The 
reaction shown in figure 29 A has ^^oi^and Atco/ATa/ values of 0.49 min' and 4.1 min" 
^mM"', respectively, and the reaction in figwe 29 B has corresponding values of 0.27 
min ' and 2.5 min ^inM"\ respectively. These values are comparable to those 
observed when the linkers are mismatched and the ACP-KS pairs are matched 
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(figures 27 B and 28 B), indicating that, in this case, the presence of mismatched 
linkers and the deletion of complete linker pairs are kinetically equivalent. 
[00209] ACP4(0) was not able to efficiently transfer substrates to module 3, 
regardless of which N-terminal linker was covalently fused to the module (figure 
29 C-D). This result was expected based on the above observation that chaimeling to 
module 3 is eliminated in the absence of both matched ACP-KS pairs and matched 
linker pairs. In contrast, transfer of diketide from ACP4(0) to modules 2 and 6 was 
observed (figures 29 E-F, respectively), despite the elimination of linker interactions 
and the non-consecutive ACP-KS pairs. By comparison to the kinetics parameters for 
tlie same reaction catalyzed by modules 2 and 6 in the presence of matched linkers 
(figure 29G-H) (Wu, et al., (2001) J. Am. Chem. Soc. 123, 6465-6474), we note that 
the ^60^ values drop approximately 10-fold and the Aco/Xii/ values drop 
approximately 70-300 fold when the linker interactions are eliminated. These data 
suggest that modules 2 and 6 are weakly, but demonstrably more tolerant to unnatural 
dohor proteins than modules 3 and 5. 

Example 12 

Tolerance of modules 2 and 6 for unnatural donor proteins 
[00210] ACP2(0), eryLDD(0), NovH(0), and NovH(4) were examined as 
potential donor proteins for the transfer of diketide to modules 2 and 6 (figure 30 A; a 
radio-TLC image is shown in figure 30 B). Reactions of these same ACPs and PCPs 
were also performed with (3)M3+TE, (5)M3+TE, (5)M5+TE, and (3)M5+TE. As 
predicted by previous experiments, substrate transfer from any of these linkerless 
donor proteins to module 3 or 5 was not observed (data not shown). In contrast, both 
the ACP domains (ACP2(0), eryLDD(0)) were able to channel the diketide substrate 
to both (5)M2+TE and (5)M6+TE, despite the absence of matched linker pairs. 
[00211] NovH(0) is an adenylation-peptidyl carrier protein (A-PCP) didomain 
involved in the biosynthesis of the coumarin ring of novobiocin (Chen, et al, (2001) 
Chem Biol 74, 1-12). This protein has no apparent C-terminal linker region as 
determined by sequence alignment and does not naturally interact with any known 
PKS domain in its role in novobiocin biosynthesis. In our assays, NovH(0) was not 
able to transfer the diketide substrate to either (5)M2+TE or (5)M6+TE without the 
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benefit of linker interactions. However, interaction between the NRPS-derived donor 
protein and PKS modules could be induced by engineering the C-terminal linker from 
DEBS module 4 on to the C-tenninal end of NovH to create NovH(4). With the 
benefit of matched linker pairs, NovH(4) was able to channel the diketide substrate to 
module 2 with a keoiM value of 0. 1 6 min'' and a kca/Knf value of 3.5 min'mM'' and to 
module 6 with a A^eonw value of 0.53 min"^ and a ^^a/^i^A/ value of 8.7 min'^mM'^ As 
the first demonstration of engineered interface involving the interaction of an NRPS 
domain that does not naturally interact with any PKS domains and a PKS domain tliat 
does not naturally interact with any NRPS domains, the experiment illustrates the 
power and utility of tibie linker regions for engineering artificial inteipolypeptide 
junctions. 

[00212] As used herein, the terms "a", "an", and "any" are each rtitended to include 
both the singular and plural forms. 

[00213] Numerous modifications may be made to the foregoing systems without 
departing from the basic teachings thereof. Although the present invention has been 
described in substantial detail with reference to one or more specific embodiments, 
those of skill in the art will recognize that changes may be made to the embodiments 

specifically disclosed in this application, yet these modifications and improvements 
are within the scope and spirit of the hivention, as set forth in the specification, 
drawings, and claims. All publications or patent documents cited in this specification 
are incorporated herein by reference as if each such publication or document was 
specifically and individually indicated to be incorporated herein by reference. 
[00214] Citation of the above publications or documents is not intended as an 
admission that any of the foregoing is pertinent prior art, not does it constitute any 
admission as to the contents or date of these pubUcations or documents. 
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We claim: 

1. A method to prepare a hybrid modular polyketide synthase (PKS) from 
individual modules which method comprises 

providing at least a first naturally occurring extender module comprising an 
ACP domain and a second naturally occurring extender module comprising a KS 
domain which is downstream of the ACP domain in a naturally occurring PKS, 

wherein the C-terminus of said ACP domain is covalently linked to the N- 
terminus of a naturally occurring intrapolypeptide linker (RAL) or interpolypeptide 
linker (ERL) and the N-terminus of said KS domain is covalently linked to the C- 
tenninus of said RAL or ERL, and 

wherein either said first module or second module is not covalently linked to 
said RAL or ERL in a naturally occurring polyketide synthase. 

2. A method of preparing a polyketide using the hybrid PKS of claim 1, 
comprising the steps of preparing a polyketide intermediate using the first module and 
transferring said intermediate to the second module. 

3. The method of claim 1 , wherein the ACP domain of the first module is from a 
first PKS and the entire second module is from the same PKS. 

4. The method of claim 1 , wherein the entire first module is from a first PKS and 
the KS domain of the second module is from the same PKS. 

5. The method of claim 1, wherein the first and second module each comprise a 
KS; AT; 0, 1, 2, or 3 pketomodifying (pKM) domains; and an ACP domain wherein 
the KS and ACP domains are from a first PKS and the AT and pKM domains are 
from a different PKS. 

6. A polyketide synthase prepared by the method of claim 1 . 
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7. The PKS of claim 6, wherein said RAL is selected from the group consisting 
of M2 ery, M4 ery, M6 ery, M2 rif, M3 rif, M5 rif, M3 rap, M4 rap, and M7 rap 
intrapolypeptide module linkers (SEQ. ID. NO's: 18-26, respectively). 

8. The PKS of claim 6, wherein the ERL is selected from the group consisting of 
M2>ery, M5 ery, M4 rif, Ml rif , M8 rif U9 rif M5 rap, and Mil rap 
interpolypeptide linkers (SEQ. ID. NO's: 27-34, respectively). 

9. The PKS of claim 6, wherein said first module comprises the ACP domain of 
ery module 4 and said second module comprises the KS domain selected from the 
group consisting of ery module 5 and 6. 

10. The PKS of claim 6, wherein said first module comprises the ACP domain of 
ery module 2 and said second module comprises the KS domain selected from the 
group consisting of ery module 3 and 5. 

1 1 . The method of claim 1, wherein the C-terminus of said provided ACP domain 
is linkerless and then is covalently linked to the N-terminus of a naturally occurring 
intrapolypeptide linker (RAL) or interpolypeptide linker (ERL). 

12. A PKS prepared by the method of claim 1 1 . 

13. The PKS of claim 12, wherein said first module comprises the linkerless ACP 
domain of ery module 4 and said second module comprises the KS domain selected 
from the group consisting of ery module 5 and 6. 

14. The PKS of claim 12, wherein said first module comprises the Imkerless ACP 
domain of ery module 2 and said second module comprises the KS domain fi-om ery 

module 6. 

15. The PKS of claim 12, wherein the said first module comprises the linkerless 
ACP domain of ery loading didomain (LDD) and said second module comprises the 
KS domain selected from the group consisting of e7y module 2 and 6. 

16. A method to prepare a hybrid modular polyketide synthase (PKS) from 
individual modules which method comprises 
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providing at least a first naturally occxirring extender module comprising an 
ACP domain and a second naturally occurring extender module comprising a KS 
domain which is not normally downstream of the ACP domain in a naturally 

occurring PKS, 

wherein the C-terminus of said ACP domain is covalently linked to the N- 
terminus of a naturally ofccurring intrapolypeptide linker (RAL) or interpolypeptide 
linker (ERL) and the N-terminus of said KS domain is covalently linked to the C- 
terminus of said RAL or ERL, and 

wherein either said first or second module is not covalently linked to said RAL 
or ERL in a naturally occurring polyketide synthase. 

17. A method of preparing a polyketide using the hybrid PKS of claim 16, 
comprising the steps of preparing a polyketide intermediate using the first module and 
transferring said intermediate to the second module. 

1 8. The method of claim 16, wherein the ACP domain of the first module is firom 
a first PKS and the entire second module is firom the same PKS. 

19. The method of claim 16, wherein the entire first module is from a first PKS 
and the KS domain of the second module is firom the same PKS. 

20. The method of claim 16, wherein the first and second module each comprise a 
KS; AT; 0, 1, 2, or 3 pketomodifying (PKM) domains; and an ACP domain wherein 
the KS and ACP domains are from a first PKS and the AT and pKM domains are 
from a different PKS. 

21 . A PKS prepared by the method of claim 1 6. 

22. The PKS of claim 21, wherein said first module comprises the ACP domain of 
ery module 4 and said second module comprises the KS domain selected from the 
group consisting of ery module 2 and 3. 



79 



wo 03/076581 



PCT/US03/06910 



23. The method of claim 16, wherein the C-tenninus of said provided ACP 
domain is linkerless and then is covalently linked to the N-terminus of a naturally 
occurring intrapolypeptide linker (RAL) or interpolypeptide linker (ERL). 

24. A PKS prepared by the method of claim 23. 

25. The PKS of claim 24, wherein the said first module comprises the linkerless 
ACP domain of ery module 4 and said second module comprises the KS domain from 
ery module 2. 

26. The PKS of claim 24, wherein the said first module comprises the linkerless 
ACP domain of ery module 2 and said second module comprises the KS domain from 
ery module 2. 

27. A method to prepare a hybrid nonribosomal peptide synthetase-modular 
polyketide synthase (NRPS-PKS) from individual modules which method comprises 

providmg at least a first naturally occurring extender module comprising a 
peptidyl carrier protein (PCP) domain from a naturally occurring NRPS and a second 
naturally occurring extender module comprising a KS domain from a PKS, 

whereui the C-terminus of said PCP domain is covalently linked to the N- 
terminus of a naturally occurring intrapolypeptide linker (RAL) or interpolypeptide 
linker (ERL) and the N-terminus of the KS domain is covalently linked to the C- 
terminus of said RAL or ERL, and 

wherein either said first or second module is not covalently linked to said RAL 
or ERL in a naturally occurring NRPS or PKS. 

28. A method of preparing a peptide-polyketide using the hybrid NRPS-PKS of 
claim 27, comprising the steps of preparing a peptide intermediate using the first 
module and transferring said intermediate to the second module. 

29. A hybrid NRPS-PKS prepared by the method of claim 27. 
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30. The hybrid NRPS-PKS of claim 29, wherein said RAL is selected from the 
group consisting of M2 ery, M4 ery, M6 ery. Ml rif, M3 rif, M5 rif, M3 rap, M4 rap, 
and M7 rap intrapolypeptide linkers (SEQ. ID. NO's: 18-26, respectively). 

31. The hybrid NRPS-PKS of claim 29, wherein the ERL is selected from the 
group consisting of M3 ery, M5 ery, M4 rif, M7 rif, M8 rif, M9 rif, M5 rap, and 
Mil rap interpolypeptide linkers (SEQ. ID. NO's: 27-34, respectively). 

32. The hybrid NRPS-PKS of claim 29, wherein said first module comprises the 
PCP domain of NovH and said second module comprises the KS domain selected 
from the group consisting of ery module 2 and 6. 
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(a) INTRA-POLYPEPTIDE LINKER 

M2 ery : GGATGAEQAAPATT . . AP VD 
M4ery: VGDAD. .QAA.VRWGAA-DES 
M6ery: VGAAEAEQA. PALVREVPKDAD 
M2rif: FGSA.A.NR. PAEIGTAAAE 
M3rif : L6 . . ER . PAAPAPVTRDVSD 
M5rif : 6ETVAGAPATPVTTVADAG 
M3rap: . ELFTGENPAPVRGPVSAVGQD 
M4rap: . ELFTGENPAPVRGPVS WGQD 
M7r ap : . ELFTGENPAP VHGP VSA . GQD 

fb^ N-TERMINf AT. TNTER-P OT.YPETIDE T JNTgF.R 

M3ery: VTD SE KVAEYLRR .ATLDLRAAR QRIRE..LES • 

M5ery: MSGDNGM.TE E.KLRRYLKR TVT.ELDSVT ARLRE..VEH RAG ■ 

M4rif ; MSAPNE QIVDAL.R ASLKE N VRLQQENSAL AAAAA 

M7rif : VSASYE KWEAL.R KSLEE V GTLKKRNRQL ADAAG 

MSrif: V.AD EGQLRDYLKR .AIADARDAR TRLRE..VEE QAR 

M9rif: MATD E.KLLKYLKR .VTAELHS.. . .LRKQGARH .AD 

MSrap: MR.. EDQLLDAL.R KSVKB N ARLRKANTSL RAAMD 

Mllrap: M.PEQD KWEYL.R WATAELHTTR AKL EA LAAANT 
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Figure 11 
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Figure 31 A 
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Figure 31 B 
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